Introduction to Matrices 


Definition 


Q The trace tr(^4) of an n X n matrix A = (a^) is the sum of the diagonal entries of A. That is, 
tr(A) = a a ■ 

The transpose ^4 T of an m X n matrix A is the n X m matrix whose (i, j) -entry is . 

A matrix A is called symmetric if A T = A. 

We say two matrices A, B commute if AB = BA. 

The entries an of a matrix A = (Aij) are called diagonal entries. 

A diagonal matrix is a square matrix whose non-diagonal entries are all zero. 

Ann X n matrix whose diagonal entries are all 1 is called the identity matrix and denoted by I n , or 
simply by I. 


Summary 


For matrices A, B, and a scalar r, the followings are true when the expressions defined: 

1. (A + B) t = A t + B t 

2. (. AB) t = B t A t 

3. (r^4) T = tv1 t 

For n X n matrices A, B, and a scalar r, we have 

1. tr(A + B) = tr(^) + tr(B) 

2. tr(rvl) = rtr(vl) 


Problems 

Let A and B are matrices such that the matrix product AB is defined and AB is a square matrix. Is it true that 
the matrix product BA is also defined and BA is a square matrix? If it is true, then prove it. If not, find a 
counterexample. 

Defmition/Hint. 

Let ibeanmxn matrix. 

This means that the matrix A has m rows and n columns. 

Let B be an r X s matrix. 

Then the matrix product AB is defined if n — r, that is, if the number of columns of A is equal to 
the number of rows of B. 

















Definition. A matrix C is called a square matrix if the size of C is n X n for some positive integer 
n. 

(The number of rows and the number of columns are the same.) 

Proof. 

We prove that the matrix product BA is defined and it is a square matrix. 

Let A be an m X n matrix and B be an r X s matrix. 

Since the matrix product AB is defined, we must have n — r and the size of AB is m X s. 

Since AB is a square matrix, we have m — s. 

Thus the size of the matrix B is n X m. 

From this, we see that the product BA is defined and its size is n X n, hence it is a square matrix. 


Let A and B be 2 X 2 matrices. Prove or find a counterexample for the statement that 

(A-B)(A + B) — A 2 — B 2 . 

Hint. 

In general, matrix multiplication is not commutative: AB and BA might be different. 

Solution. 

Let us calculate ( A — B)(A + B) as follows using the fact that the matrix product is distributive. 

(A - B)(A + B) = A(A + B)~ B(A + B ) 

= A 2 +AB- BA- B 2 
= A 2 -B 2 + (.AB - BA). 

Thus if (A — B) [A + B) = A 2 — B 2 then AB — BA — O, the zero matrix. Equivalently, 
AB = BA. 

Note that matrix multiplication is not commutative, namely, AB BA in general. 

Thus we can disprove the statement if we find matrices A and B such that AB BA. 

For example, let 



'1 

r 


o r 

A = 



and B — 


0 

0 


0 1 


Then we have 



'0 

2' 


'0 

o ' 

AB = 


and BA = 


.0 

0 . 


.0 

0 . 


Since AB BA , we have ( A — B)(A + B) A 2 — B 2 for 










Hence we found a counterexample for the statement. 


Let A and B be n X n matrices. Suppose that the matrix product AB = O, where O is the n X n zero matrix. 
Is it true that the matrix product with opposite order BA is also the zero matrix? If so, give a proof. If not, give a 
counterexample. 


Solution. 


The statement is in general not true. We give a counter example. 
Consider the following 2x2 matrices. 


Then we compute 



"0 1 ■ 


"1 1 ' 

A = 

0 1 

and 

0 0 


■o 

>i 

r 1 

1 " 


"0 

0 “ 

0 


[° 

0 


0 

0 


Thus the matrix product AB is the 2x2 zero matrix O. 


On the other hand, we compute 



■i 11 ro 

1 ■ 


■0 

2 " 

BA = 

o o J [o 

1 

— 

0 

0 


Thus the matrix product BA is not the zero matrix. 
Therefore the statement is not true in general. 


Find a nonzero 3x3 matrix A such that A 2 ^ O and A 3 


= O, where O is the 3 X 3 zero matrix. 


Solution. 


For example, let A be the following 3x3 matrix. 


A = 


■0 1 
0 0 
0 0 


o- 

1 

0 


Then A is a nonzero matrix and we have 





















■o 

1 

0 ■ 


■o 

1 

O' 


■o 

0 

1 1 

a 2 = 

0 

0 

1 


0 

0 

1 

= 

0 

0 

0 


0 

0 

0 


0 

0 

0 


0 

0 

0 


The third power of A is 



■o 

0 

1 1 


■o 

1 

0 ■ 


■o 

0 

0- 

A 3 = A 2 A = 

0 

0 

0 


0 

0 

1 

= 

0 

0 

0 


0 

0 

0 


0 

0 

0 


0 

0 

0 


Thus, the nonzero matrix A satisfies the required conditions A ^ O, A = O. 


Comment. 

A square matrix A is called nilpotent if there is a non-negative integer k such that A k is the zero 
matrix. 

The smallest such an integer k is called degree or index of A. 


The matrix A in the solution above gives an example of a 3 x 3 nilpotent matrix of degree 3. 


0 Let 



2 

-1 


and u = 


1 

0 


Compute A 2017 u. 

(The Ohio State University) 


Solution. 


We first compute Aw. We have 



r-i 2 I 

r 1 1 

■-1 ■ 


■1 ■ 

Au = 


= 


— _ 


[° -u 

M 

0 


0 


Thus we have 


Au = - u. 


Then using this result repeatedly we have 

A 2 u = A(Au) =A(- u) = -Au= - ( - u) = u. 


Thus we have 


^4 2 u = u. 


Next, we compute 























A 3 u = A(A 2 u) = A(u) = - u. 


Now you see the pattern and obtain 


A n u 


-u if^isodd 
u if fz is even. 


(To prove this pattern, use mathematical induction. See below.) 
Thus we conclude that 


A 2017 u = 


-1 

0 


since 2017 is odd. 

The proof of the formula. 

For completeness, we prove the pattern (*) we see above. 

We use mathematical induction. 

When n = 1, we computed Au = - u and the base case is true. 
Suppose that(*) is true for n = k. 

Then we have 


u if k + 1 is even 

-u if k+ 1 is odd 


A k+l n=A{A k M) = 


A( - u) nk is odd 
^(u) if k is even 


-Au if k is odd 
Au if k is even 


u if k is odd 

-u if k is even 


Here the second equality follows from the induction hypothesis. 

This proves (*) for n = k + 1 and this completes the induction step and (*) is true for any positive 
integer n. 


Let 


A = 


i i r 
o o 1 
o o 1 


be a 3 X 3 matrix. Then find the formula for A n for any positive integer n. 

Proof. 

We first compute several powers of A and guess the general formula. 
We have 






"i i r 


'i i r 


'1 1 3' 

0 0 1 


0 0 1 

= 

0 0 1 

.0 0 1. 


.0 0 1. 


.0 0 1. 



"1 1 3' 


"1 1 

r 


"1 1 5' 

A s = A 2 A = 

0 0 1 
.0 0 1. 


0 0 
.0 0 

1 

1. 


0 0 1 
.0 0 1. 


A 4 = A S A = 

"1 1 5' 

0 0 1 


"1 1 

0 0 

r 

1 

_ 

"1 1 7' 

0 0 1 


0 0 1 


0 0 

1 


0 0 1 


From these computations, we guess the general formula of A n is 


A n 


1 1 2n-l 

0 0 1 
0 0 1 


We prove this formula by mathematical induction on n. 
The base case n — 1 follows from the definition of A. 


Suppose that the formula is true for n — k. 
We prove the formula for n = k + 1. 

We have 


A k+1 = A k A 

1 1 


0 

0 

1 

0 

0 

1 

0 

0 


2k- 1 
1 
1 

2k + 1 
1 
1 

2{k + 1) - 1 

1 

1 


by the induction hypothesis 


Thus the formula holds for n — k + 1. 

Flence the formula is true for any positive integer n by induction. 


A square matrix A is called idempotent if A 2 — A. 

(a) Suppose A is an n X n idempotent matrix and let / be the n X n identity matrix. Prove that the matrix I — A 
is an idempotent matrix. 

(b) Assume that A is an n x n nonzero idempotent matrix. Then determine all integers k such that the matrix 
I — kA is idempotent. 

(c) Let A and B be n X n matrices satisfying AB — A and BA — B. Then prove that A is an idempotent 
matrix. 




























Proof. 

(a) Prove that the matrix I — A is an idempotent matrix. 

To prove that I — A is an idempotent matrix, we show that [I — A) 2 — I — A. 

We compute 

(I-A) 2 = (I-A)(I-A) 

= 1(1 -A)- A(I - A) 

= 1- A- A + A 2 
= I - 2 A + A 2 

— I — 2A + A since A is idempotent 

= 1-A. 

Thus, we have (/ — A) 2 — I — A and I — A is an idempotent matrix. 


(b) Determine all integers k such that the matrix / — kA is idempotent. 


Let us find out the condition on k so that I — kA is an idempotent matrix. 
We have 


(I-kA) 2 = (I - kA) (I - kA) 

= 1(1 - kA) - kA(I - kA) 

— I - kA — kA + k 2 A 2 

= I — 2kA + k 2 A since A is idempotent 

— I — (2k — k 2 )A. 

It follows that I — kA is idempotent if and only if I — kA = I — (2k — k 2 )A, or equivalently 
(k 2 — k)A = O, the zero matrix. 

Since A is not the zero matrix, we see that / — kl is idempotent if and only if A: 2 — k — 0. 

Since k 2 — k — k(k — 1) , we conclude that I — kA is an idempotent matrix if and only if 

A: = 0,1. 

(c) Prove that A is an idempotent matrix. 

Let A and B be n X n matrices satisfying 

AB = A (*) 

and 


BA = B. 




Our goal is to show that A 2 — A. 
We compute 


A 2 = AA 


= ( AB)A 

by (*) 

= A(BA) 


= AB 

by (**) 

— A 

by (*)• 


Therefore, we have obtained the identity A 2 = A, and we conclude that A is an idempotent matrix. 


Let A — 


1 3 

2 4 


(a) Find all matrices B — 


x y 
z w 


such that AB = BA. 


(b) Use the results of part (a) to exhibit 2x2 matrices B and C such that AB = BA and AC ^ C A. 

Solution. 


Find all matrices B such that AB = BA 


Since we want to find B such that AB = BA , we need to find x,y,z,w satisfying 


1 

h- 1 

co 


*2 


i 

R 

*2 


'1 3' 

.2 4. 


9 


i 

U 

6 


.2 4. 


Comparing entries, we have the following system of linear equations. 


x + 3z — x + 2y 
y + 3w — 3x + Ay 
2x + Az — z + 2w 
2y + Aw — 3z + Aw. 


Simplifying this, we obtain 


2y — 3z = 0 
3x + 3y — 3w = 0 
2x + 3z — 2w = 0 
2y — 3z — 0. 

Note that the first and the last equations are identical. So we omit the first equation from our 
consideration. 

To solve this system, we use the Gauss-Jordan elimination method. Namely we form the augmented 
matrix of this system and apply elementary row operations as follows. 


3 

3 

0 

-3 

0 " 

jRi 

" 1 

1 

0 

-1 

0 " 

2 

0 

3 

-2 

0 

-* 

2 

0 

3 

-2 

0 

0 

2 

-3 

0 

0 


0 

2 

-3 

0 

0 


1 

1 

0 

-1 

0 " 

R\ + J i?2 

i?2 +R 3 

" 1 

0 

3/2 

-1 

0 ' 

0 

-2 

3 

0 

0 


0 

-2 

3 

0 

0 

? 

0 

2 

-3 

0 

0 


0 

0 

0 

0 

0 


" 1 

0 

3/2 

-1 

0 " 

0 

1 

-3/2 

0 

0 

0 

0 

0 

0 

0 


The last matrix is in reduced row echelon form. From this, we obtain the general solution 

































3 

X — —-Z + w 

Li 

3 

y = -z, 


where z and w are free variables. 

Therefore the matrix B such that AB = BA must be 


-f Z + W | z 


z w 



- _ 3 

3 " 


'i 

o' 

z 

2 

2 

+ w 



. 1 

0. 


.0 

1 . 


for any z and w. 

(b) Find matrices S, C such that AB = BA , AC ^ CA 
By part (a), if 



3. 

3 - 


'i 

o' 

B = z 

2 

2 

+ w 



1 

0_ 


.0 

1. 


for some z, w , then we have AB — BA. 

For example, let z — 2 ,w — 0. Then the matrix 


B = 



3 

0 


satisfies AB — BA. 

To find a matrix C such that AC C A, the matrix C must not be of the form of the formula of B. 

For example, let 


c = 


0 

1 


0 

0 


You may directly check that AC CA. 

Or, we can show that C is never be the matrix of the form 

’-f -z + w | z 
z w _ 

To see this, compare (1,2) and (2,1) entries. There is no z such that 

3 

— z — 0 and z — 1. 

2 

(This is how I found the matrix C as above.) 


















Let 

0 
1 
0 


A = 


12 3 
4 5 6 


,B = 


10 1 
0 1 0 


,C = 


1 2 
0 6 


Then compute and simplify the following expression. 

v T (A t -(A- B) t ) C. 

Solution. 

(b) Matrix product and transpose 


First of all, note that by the property of the transpose we have 

(A - B) t = A t - B t . 

Hence we can simply the middle part: 

A T -{A- B) t =A t - ( A t - B t ) 

= A t -A t + b t = b t . 


Thus the expression becomes 


v T (a t -(A- b ) t ) c = v T n T C. 

The transposes of v and B are 


V T = [0 1 0] andS T 


1 0 
0 1 
1 0 


Thus we have 


v T (A t - (A 
= [0 1 0 ] 


- B) t ) C = v t B t C 


1 

0 

1 

2 ' 

6 . 



'1 

2 


.0 

6 . 


= [0 6 ]. 


In conclusion, we have obtained 


v T (A T - (A - B) T ) C = [0 6]. 


Let v = [ 2 —5 — 1 ]. Find all 3 X 1 column vectors w such that vw = 0. 

Solution. 


Let w = 


w i 
w 2 

Ws 























Then we want 


vw = [ 2 -5 -11 


w i 
w 2 
Ws J 


= 2wi-5w 2 -ws — 0 . 


Letting W\ and w 2 be free variables, we solve W 3 — 2w\— 5w 2 . 
Then every solution to the equation vw = 0 is of the form 



W\ 




' 0 ' 

w = 

w 2 

_ 2 wi 5w 2 _ 

= W\ 

0 

_2_ 

+ W 2 

1 

.-5. 


Prove the following identity for any positive integer n. 


COS# 

— sin# 

n 

cos n# 

— sin n# 

sin# 

cos# 


sin n# 

cos n# 


Hint. 


Recall the addition theorem of trigonometric functions (sum formula) 


sin(x + y) — sin x cos y + cos x sin y 
cos (x + y) — cos x cos y — sin x sin y. 


Use the induction on n. 


Proof. 


We prove the identity by induction on n. 
The base case n — 1 is clear. 

Suppose that the identity is true for n = k. 
Then we have 


cos 0 

— sin# 

k+1 

sin# 

cos# 


cos 9 

— sin# 


COS# 

sin# 

cos# 


sin# 

cos# 

— sin# 


cos k9 

sin# 

cos# 


sin kO 


— sin# 
cos# 

— sin kO 
cos k6 

(by the induction hypothesis) 

cos # cos k9- sin # sin kO — cos # sin kO — sin # cos k9 
sin # cos kO + cos # sin kO — sin # sin kO + cos # cos kO 
(by matrix multiplication) 

" cos(# + kO) — sin(# + k6 )" 

_ sin(# + kO ) cos(# + kO ) 

(by the addition theorem of trigonometric functions) 

"cos (k + 1)9 — sin(fc + 1)# 

_sin(fc + 1)# cos(fc + 1)# 






























This proves that the identity holds for n = k + 1. 

Thus by induction, the identity is true for all positive integers n. 


Let A and Bben X n matrices. Is it always true that tv(AB) = tr(^4)tr(.B)? If it is true, prove it. If not, give 
a counterexample. 

Solution. 


There are many counterexamples. 
For one, take 


'1 o' 


'0 O' 

and B = 

.0 o. 


.0 1. 


Then tr(4) 


AB = 


0 

0 


= l,tr(.B) = 1, and hence tr(j4)tr(5) 

O' 

0 


1, while tr(j45) = 0 as 


Let A be an n X n matrix. Is it true that tr(A T ) = tr(A)? If it is true, prove it. If not, give a counterexample. 

Solution. 

The answer is true. Recall that the transpose of a matrix is the sum of its diagonal entries. Also, note 
that the diagonal entries of the transposed matrix are the same as the original matrix. 

Putting together these observations yields the equality tr(^4 T ) = tr(-A). 


Here is the more formal proof. 

For A = (cLij)i<ij<n> the transpose A T = (bij)i<ij< n is defined by b{j — 
In particular, notice that bu — an for 1 < i < n. And so, 

n n 

tr(A T ) = frit = y^gg = tr(^4). 

1=1 1=1 


Let v and w be two n X 1 column vectors. 

(a) Prove that v T w = w T v. 

rp rp 

(b) Provide an example to show that vw is not always equal to wv . 


Solution. 

(a) Prove that v T w = w T v. 


Suppose the vectors have component 










~Vi " 


W\ 

V = 

V2 

and w = 

w 2 


- v n_ 


- w n_ 


Then, 


V W = [v\ V 2 


while 


w i 
w 2 

W n 


Vl 

V2 


= y^ y vjWi 

i =1 


W V = ^ 1^2 




= y ^wjVj. 


i=1 


We can see that they are equal because V{ = WiV{. 


(b) Provide an example to show that v w T is not always equal to w v T . 

rii 

For the counterexample, let v = and w = 


. Then 


T 

vw = 


[o i] = 


while 


wv 


1 0] 


0 1 
0 0 


0 0 
1 0 


Comment. 

Recall that for two vectors v, w E R n , the dot product (or inner product) of v, w is defined to be 


v • w := v w. 


Part (a) of the problem deduces that the dot product is commutative. This means that we have 


v • w = w • V. 


In fact, we have 


T ( a ) T 

V • W = V w = w vw • V. 


Also, notice that while vw 1 is not always equal to wv 1 , we know that (vw T ) T = wv 


Let v and w be two n X 1 column vectors. Prove that tr(vw T ) = v T w. 






















Solution. 

Suppose the vectors have components 


Then, 


We can now see that 



Vi 


W\ 

V = 

V2 

and w = 

W2 


- v n_ 


- w n_ 


vw 


Vi 

V2 

L v nj 


[W 1 W 2 




V\W\ 

VIW2 • 

* * Vl W n 

V2W\ 

V 2 W 2 • 

* * V2 W n 

_V n Wi 

V n W 2 * 

* * V n W n 


tr(vw T ) = ^ 1 


V{Wi = V w. 


i =1 


Comment. 


Recall that v 1 w is, by definition, the dot product of the vectors v and w. 


So, the dot product of vectors v and w is equal to the trace of the matrix vw 


Let v be an n X 1 column vector. Prove that v T v = 0 if and only if v is the zero vector 0. 

Proof. 


Let v = 


Vi 

V2 


. Wi 
Then we have 


V V = [Vi V2 


Vi 

V2 

V n J 


£v?- 


i —1 


Because each vf is non-negative, this sum is 0 if and only if Vi =0 for each i. In this case, v is the 
zero vector. 


Comment. 














Recall that the the length of the vector v £ is defined to be 


11v11 := v / v T v. 

The problem implies that the length of a vector is 0 if and only if the vector is the zero vector. 


G Is it true that a real square matrix A must commute with its transpose A T ? 

Solution. 

(a) Is it true that ,4 must commute with its transpose? 

The answer is no. 

We give a counterexample. Let 


Then the transpose of A is 




A t = 


1 0 

-1 2 


We compute 


AA t = 


■1 

_1 1 

r 1 

0 ■ 


■ 2 

-2 ■ 

0 

2 J 

[- 

2 


-2 

4 


and 


A t A = 


' 1 

°i 

n -li 


■ 1 

-1 " 

-1 


[O 2 


-1 

5 


Therefore, we see that 


AA t + A t A , 


that is, A does not commute with its transpose A 


G 


Let v be an n X 1 column vector. Prove that vv is a symmetric matrix. 

Definition (Symmetric Matrix). 

A matrix A is called symmetric if A T — A. 

In terms of entries, an n X n matrix A — (a^) is symmetric if 


— dji for all 1 < i, j < n. 


Proof. 















Let v = 


Vi 

V2 


. Then we have 



Vi 



V\V\ 

V\V 2 • 

•• ViV n 

T 

V2 



V 2 Vi 

v 2 v 2 • 

* * V 2 V n 

VV = 

- v n_ 

[vi V 2 ■ 

■ • v n ] = 

_V n Vl 

v n v 2 • 

•• v n v n _ 


In particular, the the (i, j)-th component is 

(w T )ii = ViVj - VjVi - (vv T ) rJ . 


T 

This shows that the matrix vv is symmetric. 


Calculate the following expressions, using the following matrices: 


A = 


2 3 
-5 1 


B = 


0 -1 

1 -1 


2 

-4 


(a) AB t + vv T . 

(b) Av-2v. 

(c )v T B. 

(d) v T v + v t BA t v. 


Solution. 


(a) AB t + vv T 








AB t + vv T = 

' 2 
.-5 

3 

1. 

' 0 
.-1 

1 

-1 

+ 

N 1, 

[2 -4] 


'-3 

-1 


' 4 

—8" 




.-1 

-6. 

+ 

.-8 

16. 




' 1 

-9' 







.-9 

10. 






(b) Av- 2v 



2 

3' 

' 2 ' 


' 2 ' 


^4v-2v = 




-2 





.-5 

1. 

.-4. 


.-4. 




' -8 

- 

'-4' 





— 

.-14 

+ 

. 8 . 





-12 

-6 


(c)v T n 







































(d) v T v + v T BA T w 
v T v + w T BA 
= [2 -4] 

= 18+ [-4 

= 18-92 
= -74 


w T B = [2 -4] 

— [ —4 2 ] 


v 



0 -1 

1 -1 



S> Let A and B be n X n matrices, and v an n x 1 column vector. Use the matrix components to prove that 
(+ + B)v = Aw + Bw. 

Solution. 

We will use the matrix components A = {dij)i<i,j<n, B = and v = (vi )i<i< n . 


Then 


(A + B)w = 


£"=1 (« 1 3 + bi i) v i 

£"=1 («2 j + b 2j )Vj 
£"=1 («3 j + hj)Vj 


£j=l ( a nj + b n j)vj 


On the other hand, 



Lj=i °i j v j 


’£"= i 


’£J=i(«ij + M u j" 


Lj=l a 2j v j 


£"=1 b 2m 


£”=l(«2i + &2j) w J 

Aw + Bv — 

Lj=l a 3j v j 

+ 

£"=i b vvj 

= 

£”=l(«3j + &3j) u J 


•sr^n 

_ L^j= 1 a nj v j _ 


_ £j=1 b nj v j - 


_£j=l ( a nj + b nj) v j " 


We can see that they are the same. 


© 


Let A be a square matrix such that A T A = A , where A T is the transpose matrix of A. Prove that A is 
idempotent, that is, A 2 = A. Also, prove that A is a symmetric matrix. 























Recall the basic properties of transpose matrices. 
For matrices A, B, we have 


© ( AB) t = b t a t . 
®A TT = A. 


Proof. 

We first prove that A is a symmetric matrix. 
We have 


A t = ( A t A) t 

= A t A' 1 ' 1 by property 1 
— A v A by property 2 

= A. 

Hence we obtained A l T — A, and thus A is a symmetric matrix. 


Now we prove that A is idempotent. 
We compute 


A 2 = AA 

= A T A since A is symmetric 

— A by assumption. 


Therefore, the matrix A satisfies A 2 = A , and hence it is idempotent. 


© Let A and B be n X n real symmetric matrices. Prove the followings. 

(a) The product AB is symmetric if and only if AB — BA. 

(b) If the product AB is a diagonal matrix, then AB — BA. 

Hint. 

A matrix A is called symmetric if A = A T . 


In this problem, we need the following property of transpose: 
Let A be an m x n and B be an n x r matrix. Then we have 


(AB) t = B t A t . 

(When you distribute transpose over the product of two matrices, then you need to reverse the order 
the matrix product.) 


Solution. 



(a) The product^# is symmetric if and only if AB = BA. 

Suppose AB is symmetric. This means that we have 

AB = (AB) 1 = B 1 A 1 = BA. 

The second equality is a general property of transpose. In the last equality we used the assumption 

T T 

that A, B are symmetric, that is, A = A ,B = B . Thus we have AB = BA. 

On the other hand, if AB = BA, then we have 

AB = BA = B 1 A 1 = (AB) 1 . 

Thus, we have AB = {AB) , hence AB is symmetric. 

If the products is a diagonal matrix, then AB = BA. 

Note that a diagonal matrix is symmetric. Hence the result follows from part (a). 


$A2 X 2 matrix has two parallel columns and tr {A) — 5. Find tr(A 2 ). 

Solution. 

For (a) and (b), we give two solutions. The first one does not use the knowledge of eigenvalues, and 
the second one uses eigenvalues. 

Solution 1 of (a) A 2 x 2 matrix^ satisfies tr (A 2 ) = 5 and tr (A) = 3. Find det (A). 

Let 


Then we have 


A 


"a b ‘ 

c d 


" a b I 

Ta b' 


a 2 + be 

ab + bd 

c d I 

[c d 


ac + cd 

be + d 2 


Since ix{A 2 ) = 5 and ix{A) = 3, we obtain 

5 = trf4 2 ) = (« a 2 + be) + {be + d 2 ) = a 1 + 2 be + d 2 and 
3 = tr {A) = a + d. 


We find the determinant det {A) = ad - be as follows. 
We have 


det {A) = ad - be = 


2 ^ {a + d) 2 — {a 2 + 2 be + d 2 ) 



5)-2. 








Thus, we obtain det (A) = 2. 
Solution 2 of (a) 

Let A l and k 2 be eigenvalues of A. 
Then we have 


3 = tr (A) = A 1 +A 2 and 
5 = tr(A 2 ) = Aj + A 


Here we used two facts. 

The first one is that the trace of a matrix is the sum of all eigenvalues of the matrix. 

The second one is that k is an eigenvalue of A if X is an eigenvalue of A , and these are all the 

o 

eigenvalues of A . 

Since the determinant of A is the product of eigenvalues of A, we have 

det (A) = k 1 k 2 

= \{{h + h) 2 -tf\ + A)) 

i , 

= 2 (3 5) 

= 2 . 


Hence we have det (A) = 2. 

Solution 1 of (b) A 2 x 2 matrix has two parallel columns and tr(v4) = 5. Find tr(^ 2 ). 
Since two columns are parallel, we can write A as 


Then we have 


A = 


a ra 
c rc 


5 = tr (A) = a + rc. 


We use the formula in Solution 1 of (a) for trf4 2 ) with b = ra and d = rc , and we compute 

tr(v4 2 ) = a 1 + 2 (ra) + (rc) 2 
= (a + rc) 2 
= 5 2 = 25. 


Thus, we find trf4 2 ) = 25. 
Solution 2 of (b) 





Since two columns are parallel, the matrix A is singular. Hence A has an eigenvalue 0. 

Since the sum of all the eigenvalues is tr(^4) = 5, we see that 0 and 5 are eigenvalues ofv4. 

It follows that 0 and 25 are eigenvalues of A 2 . Hence 

tr (A 2 ) = 0 + 25 = 25. 

Solution of (c) A 2 x 2 matrix A has det (A) = 5 and positive integer eigenvalues. What is the 
trace of A? 

The product of eigenvalues of A is the determinant det (A) = 5. 

Since eigenvalues are positive integers, it follows that 1 and 5 are eigenvalues ofv4. 

It follows that 


tr (A) =1+5 = 6. 


d Let I be the n X n identity matrix, where n is a positive integer. Prove that there are non X n matrices X and Y 
such that XY-YX = I. 

Hint. 

Suppose that such matrices exist and consider the trace of the matrix XY - YX. 

Recall that the trace of a square matrix A is the sum of the diagonal entries of A. 

Proof. 

We use the following basic properties of the trace of matrices. 

Let A, B be n x n matrices and let c be a scalar. 

$ tr (A + B) = tr (A) + tr(B). 

€> tr (cA) = ctr(A). 

0 tr (AB) = tr (BA). 


Seeking a contradiction, we assume that there are matrices X and Y such that XY - YX= I. 

Then we take the trace of both sides and obtain 

n = tr(7) 

= tr {XY- YX) 

= tr (XY) - tr (YX) (by property (1), (2) of the trace) 

= tr (XY) - tr (YX) (by property (3) of the trace) 

= 0 . 


Since n is a positive integer, this is a contradiction. 
Therefore, such matrices X , Y do not exist. 


Let A = (ciij) and B = ( [bij ) be n X n real matrices for some n E N. 

(a) Express tr(^4f? T ) in terms of the entries of the matrices ^4 and B. Here B T is the transpose matrix of B. 

(b) Show that tr(^4^4 T ) is the sum of the square of the entries of A. 

(c) Show that if A is nonzero symmetric matrix, then tr(zl 2 ) > 0. 

Hint. 


Review 

the definition of the transpose of a matrix 


the definition of matrix multiplication 


the definition of a symmetric matrix 


Then the proofs of these statement is straightforward computations. 

Proof. 

(a) Express tr ( AB T ) in terms of the entries of A and B. 

Here we use the following notation for an entry of a matrix: the (i, j )-entry of a matrix C is denoted 

b y 

Then the (i,j )-entry of AB T is (AB T )ij — 'Y^k=i a ikbjk- 
Thus we have 

tr(^lB T ) = Y^(AB t ) u = aikbik- 

1=1 1=1 k= 1 

(b) Show that tr ( AA T ) is the sum of the square of the entries of A 
By the formula obtained in part (a), we have 

tr(A4 T ) = ±t<- 

1=1 k=1 

This is the sum of the squares of entries of A. 

(c) Show that if A is nonzero symmetric matrix, then tr ( A 2 ) > 0. 

Since A is a symmetric matrix, we have A T — A. 

Thus by the result of part (b), we have 


tr (A 2 ) = tr(>L4 T ) = tt<> 0. 

1=1 k=1 

The last sum is strictly positive since A is not the zero matrix, there is a nonzero entry of A (and of 
course the square of a real number is nonnegative). 

Comment. 

The results we proved in this article can be extended to complex matrices, matrices with complex 
number entries. 

In this case, the condition in (c) that A is symmetric is replaced by the condition that A is a hermitian 
matrix. 


For each of the following matrix A, prove that x T Ax > 0 for all vectors x in IR 2 . Also, determine those vectors 
xGl 2 such that x T Tx = 0. 

4 2' 

2 1 

2 r 

1 3. ' 

Proof. 


(a) A = 
(b )A = 


(a) A = 


4 

2 


2 

1 


Letx = 


be a vector in R . Then we have 


x T ix = [x y] 

'4 2' 


X 


.2 1. 


.y. 


= x(4x + 2 y) + y(2x + y) = 4a; 2 
= 4x 2 + 4 xy + y 2 
= (2x + y) 2 > 0. 


4a; + 2 y 
.2 x + y _ 

+ 2 xy + 2 xy + y 1 


Thus we have x T 4x > 0. 

Note that x T ^4x = 0 if and only if 2a: + y — 0. 

Thus those vectors x such that x T 4x = 0 are 

1 

-2 



for any real number x. 


(b )A = 


2 

1 


1 

3 


Letx = 


be a vector in ] 


We compute 


























x t Ax 


[x y] 



X 


2 x + y 
x + 3y 


— x(2x + y) + y(x + 3 y) — 2x 2 + xy + xy + 3 y 2 

— 2x 2 + 2 xy + 3 y 2 

— x 2 + (x + y) 2 + 2y 2 > 0. 


(*) 


Thus we obtain x T ^4x > 0. 

It follows from (*) that x T Ax = 0 if and only if 

x 2 + (x + y) 2 + 2 y 2 — 0. 

Since each of x 2 , (x + y) 2 ,2y 2 is nonnegative, we must have x — y — 0. 
Therefore x T ^4x = 0 if and only if x = 0. 




(a) Prove that the matrix A — 
matrix B such that B 2 = A. 


0 1 
0 0 


does not have a square root. Namely, show that there is no complex 


(b) Prove that the 2 X 2 identity matrix I has infinitely many distinct square root matrices. 


Proof. 

(a) The matrix A does not have a square root. 


Let B = 


a 

c 


Since 


b 

d 


be the matrix such that B 2 = A. 


a b 

a b 


a 2 + be 

ab + bd 


a 2 + be (a + d)b 

c d 

c d 


ca + dc 

cb + d 2 _ 


_ (a + d)c cb + d 2 


it follows from B 2 = A that 


a 2 + be (a + d)b 


'0 1' 

_ (a + d)c cb + d 2 


O 

O 

_i 


Comparing entries we obtain four equations 


a 2 + be — 0 

(1) 

(a + d)b = 1 

(2) 

(a + d)c — 0 

(3) 

cb + d 2 — 0. 

(4) 


Equation (3) gives a + d = 0orc = 0. 

If a + d — 0, then equation (2) becomes 0 = 1. This is impossible and thus c = 0. 
Since c = 0, equations (1) and (4) yield that a — d = 0. 

However, inserting these into (2) gives 0 = 1. 

Hence there is no solution satisfying these equations. 

Therefore, there is no square root of the matrix A. 
























(b) The identity matrix has infinitely many square roots 


Let 


B = 


1 

0 


r 


-1 


5 


where r be an arbitrary real numbers. 


Then we compute directly and obtain 


1 r 


1 r 


'1 o' 

1 

0 

1 

I—* 

1 _ 


1 

i — 1 

1 

0 

_ 1 


.0 1. 


Hence B is a square root of the identity matrix I. 

Since r is an arbitrary real numbers, there are infinitely many square roots of I. 




Let D — 


di 

0 


0 

d,2 


0 

0 


be a diagonal matrix with distinct diagonal entries: di 7 ^ dj if i 7 ^ j. Let 


_ 0 0 ... d n _ 

A — (cLij) be an n X n matrix such that A commutes with D, that is, AD — DA. Then prove that A is a 
diagonal matrix. 


Proof. 

We prove that the (i, j) -entry of A is aij — 0 for i 7 ^ j. 


We compare the (i, j) -entries of both sides of AD — DA. 
Let D — ( dij ). That is, da — di and d\j — 0 if i 7 ^ j. 
The ntry of AD is 


n 

(AD)ij — ^ ^ Q'ikd'kj — CLijdj. 
k =1 


The (i,j) -entry of DA is 


n 

{DA)ij = ^ ^ djkCLkj — di&ij. 

k =1 


Hence we have 


Q>ij dj — dij di . 


Or equivalently we have 


dij {dj di ) — 0 . 


Since di 7 ^ dj , we have dj — di 7 ^ 0. 
Thus, we must have dij — 0 for i 7 ^ j. 

Hence A — (dij) is a diagonal matrix. 
















Elementary Row Operations 


Definition 


Let A be an m X n matrix. 

The following three operations on rows of a matrix are called elementary row operations. 

1. Interchanging two rows: 

Ri Rj interchanges rows i and j. 

2. Multiplying a row by a non-zero scalar: 

tRi multiplies row i by the non-zero scalar (number) t . 

3. Adding a multiple of one row to another row: 

Rj + tR{ adds t times row i to row j. 

Q Two matrices are row equivalent if one can be obtained from the other by a sequence of elementary 
row operations. 

The matrix in reduced row echelon form that is row equivalent to A is denoted by rref( A ). 

The rank of a matrix A is the number of rows in rref( A ). 


Problems 

For each of the following matrices, find a row-equivalent matrix which is in reduced row echelon form. Then 
determine the rank of each matrix. 

1 3" 


(a )A = 
(b) B = 


-2 2 

2 6 


-2 


3-2 8 


'2 -2 4 

(c) C= 4 1 -2 

6-12 
L —2 

(d) D = 3 

_ 1 

(e) E — [ —2 3 1]. 


Definition (Rank of a Matrix). 

The rank of a matrix A is the number of nonzero rows in the reduced row echelon form matrix rreff4) 
that is row equivalent to A. 


Solution. 


(a) A 


1 3 

-2 2 


The matrix A has rank 2, which can be seen by computing 


















■ 1 

3 " 

R 2 +2R l 

■1 

3 " 

S R 2 

"I 

3 " 

R i — 3i?2 

■1 

0 - 

-2 

2 


0 

8 


0 

1 


0 

1 


Because the row-reduced matrix has two non-zero rows, the rank of A is 2. 


(b)5 = 


2 6-2 

3-2 8 


The matrix B has rank 2, which can be seen by computing 

1 


■2 6 -2" 

2 R \ 

"1 

3 

-1 " 

R 2 -3R i 

"1 3 -f 

3-2 8 


3 

-2 

8 


0 -11 11 


( c ) c = 


2-2 4 

4 1 -2 

6-12 


"1 

3 

-1 " 

R { -3R 2 

"1 

0 2 “ 

0 

1 

-1 


0 

1 -1 








The matrix C has rank 2, which can be seen by computing 


2-2 4 1 

4 1 -2 


2^1 


1 -1 2 
4 1 -2 


r i -i 2 

0 5 -10 



6 

-1 

2 

6 


1 2 

R 

3-6*! 0 

5 

-10 

r 3 r 2 

-1 

-1 

2 ' 

i 

5 R 2 

1 1 

-1 

2 ' 

R l +R 2 

'I 

0 

o - 


0 

5 

-10 


0 

1 

-2 


0 

1 

-2 


0 

0 

0 


0 

0 

0 


0 

0 

0 


(d )D = 


-2 

3 

1 


The matrix D has rank 1, which can be seen by calculating: 


"-2 1 

-i 

— R \ 

"1 1 

*2-3*i 

"1 ' 

3 

-► 

3 


0 

1 


1 


0 


(e)£= [- 


-2 3 1 


The matrix E has rank 1, which can be seen by calculating: 

-l 


2 “1 


*1 r- 


■-2 3 r 


-3 -1 

1 2 2 














































Let A and / be 2 X 2 matrices defined as follows. 

1 0 " 
o 1 . ’ 

Prove that the matrix A is row equivalent to the matrix / if d — cb ^ 0 . 

Elementary row operations 

We review the elementary row operations before the proof. 

The three elementary row operations on a matrix are defined as follows. 

Q Interchanging two rows: 

Ri O Rj interchanges rows i and j. 


Multiplying a row by a non-zero scalar (a number): 
tRi multiplies row i by the non-zero scalar (number) t . 


Adding a multiple of one row to another row: 
Rj + tRi adds t times row i to row j. 



A = 


1 b 
c d 


I = 


Proof. 

Suppose that b — cd ^ 0. Then we can obtain the matrix I from the matrix A by the following 
sequence of elementary row operations. 

First, we apply i ?2 — cR\ and get 


'1 b 

i?2 —cRi 

1 

b 

c d 

-» 

0 

d — cb 


The next step is R 2 and get 


n 

b 


n 

b 

1 

0 

d — cb _ 

- > 

1 

0 

1 . 


Note that this is where we need the assumption d — cb 7 ^ 0 since d — cb is in the denominator. 
The last step is Ri — fei? 2 , and we obtain 


n 

b 

Ri —bR2 

n 

O' 

1 

0 

1. 

- > 

1 

0 

1. 


In summary, we have the following sequence of elementary row operations from the matrix A to the 
matrix I 

I. 
























A = 


'1 b 

f?2 ~cR\ 

1 

b 

c d_ 

- > 

1 

o 

d — cb _ 



n 

b' 

R\ —bR^ 

l 

o' 

- > 

o 

1 

-> 

O 

l 


= 1 , 


and hence A and I are row equivalent. 


If A, B have the same rank, can we conclude that they are row-equivalent? If so, then prove it. If not, then provide 
a counterexample. 

Solution. 

Having the same rank does not mean they are row-equivalent. 

For a simple counterexample, consider A — [ 1 0 ] and B — [ 0 1 ]. 

Both of these matrices have rank 1, but are not row-equivalent because they are already in reduced 
row echelon form. 

Another solution. 

The problem doesn’t specify the sizes of matrices A , B. 

Note that if the sizes of A and B are distinct, then they can never be row-equivalent. 

Keeping this in mind, let us consider the following two matrices. 

Lo 

Then both matrices are in reduced row echelon form and have rank 1. 

As noted above, they are not row-equivalent because the sizes are distinct. 


and B — [ 1 0 ] • 




o 


Find the rank of the following real matrix. 


a 1 2 

1 1 1 
-1 1 1 — a 


where a is a real number. 
(Kyoto University) 


Solution. 

The rank is the number of nonzero rows of a (reduced) row echelon form matrix of the given matrix. 
We apply elementary row operations as follows. 

















a 1 

2 


'11 1 



"1 1 






Sa 

to 

1 

e 


i i 

1 

- > 

a 1 2 


- > 

0 1 - a 

.-i i 

1 — a. 


.-1 1 1- 

a. 

R 3 +R 1 

.0 2 


"1 1 

1 

7-) 1 a j-y 

'1 

1 

1 

R 2 



Rs - 2 ~ R 2 




- > 

0 2 

2 — a 

- : — > 

0 

2 

2 — a 


.0 1 - 

a 2 — a_ 


_0 

0 (2 — a)(a + l)/2_ 


The last matrix is in row echelon form. 

Therefore, if a — 1, 2, then (3, 3) -entry of the last matrix is not zero. From this we see that the 
rank is 3 when a ^ — 1, 2. 

On the other hand, when a — — 1 or a — 2 the third row is a zero row, hence the rank is 2. 


^ For mm X n matrix A, we denote by rref(A) the matrix in reduced row echelon form that is row equivalent to 

. „ ri 1 ^ 

A. For example, consider the matrix A = 

[.0 2 2 . 

Then we have 


A = 


n 

1 

1" 

\R. 

'1 

1 

1" 

R\ —R2 

1 

0 

o' 

1— 

0 

2 

2 

- > 

.0 

1 

1_ 

- > 

.0 

1 

1. 


and the last matrix is in reduced row echelon form. 

"10 0 
0 1 1 


Hencerref(A) = 


. Find an example of matrices A and B such that 
rref(AE?) ^ rref(A)rref(I?). 


Proof. 


Let 


A = 


0 1 
0 0 


and B = 


0 0 
1 0 


Then A is already in reduced row echelon from. 

So we have rref(A) = A. 

Applying the elementary row operations, we obtain 

B = 


'0 

o' 

R\ ^r^Ry, 

'1 

o' 

.1 

0. 

- > 

.0 

0. 


As the last matrix is in reduced row echelon from, we have rref (B) 
Therefore, we see that 

rref(^4)rref(.B) = 


1 0 
0 0 


0 

1' 

'1 

1 

0 


0 

1_ 

o' 

1 

0 

-1 

0 

1- 

0 

1 

0 


0 
_1 

0. 


The product of A and B is 















































1 

o 

o 

1_ 


'l 

1- 

o 


.1 0. 


i 

o 

-1 

o 


It follows that rref(AB) 


1 0 
0 0 


In summary, we have 


rref (AB) = 


'i <r 


o 

o 

1_ 

o 

o 

i_ 

/ 

o 

o 
_1 


rref(A)rref(i?), 


as required. 


(a) Find all 3 X 3 matrices which are in reduced row echelon form and have rank 1. 

(b) Find all such matrices with rank 2. 

Solution. 

(a) Find all 3 X 3 matrices which are in reduced row echelon form and have rank 1. 


First we look at the rank 1 case. 

For a 3 X 3 matrix in reduced row echelon form to have rank 1, it must have 2 rows which are all Os. 
The non-zero row must be the first row, and it must have a leading 1. 

These conditions imply that the matrix must be of one of the following forms: 


lab 


"0 1 c' 


'o o r 

0 0 0 

5 

0 0 0 

> or 

0 0 0 

1 

o 

o 

o 


i 

O 

o 

o 


o 

o 

o 


(b) Find all such matrices with rank 2. 

For a rank 2 3x3 matrix in reduced row echelon form, there must be one row, the bottom one, 
which has only Os. 

Thus we need two leading 1 s in distinct columns, and every other term in the same column with a 
leading 1 must be 0. The possibilities are: 


"1 0 a' 


"1 a O' 


'0 1 O' 

0 1 b 

? 

0 0 1 

, or 

0 0 1 

O 

O 

O 


O 

O 

O 


O 

O 

O 


If A, B , C are three m X n matrices such that A is row-equivalent to B and B is row-equivalent to C, then can 
we conclude that A is row-equivalent to Cl If so, then prove it. If not, then provide a counterexample. 

Definition (Row Equivalent). 



Two matrices are said to be row equivalent if one can be obtained from the other by a sequence of 
elementary row operations. 



























Proof. 

Yes, in this case A and C are row-equivalent. 

By assumption, the matrices A and B are row-equivalent, which means that there is a sequence of 
elementary row operations that turns A into B. 

Call this sequence r\ , 7*2 , • • • , r n , where each n is an elementary row operation. 

(Start with applying v\ to A.) 

By another assumption, B is row-equivalent to C , which means that there is a sequence of 
elementary row operations which transforms B into C ; call this sequence Si , 52, • • •, s m . 

Putting these sequences together, the operations r\ , 7*2 , • • •, r n , S \, 52 , • • •, s m will transform the 
matrix A into C. 

This proves that A and C are row-equivalent. 


Prove that if A is an n X n matrix with rank n, then rref( A) is the identity matrix. 

Proof. 

Because A has rank n , we know that the n x n matrix rref(^4) has n non-zero rows. 

This means that all n rows must have a leading 1. 

Each leading 1 must be in a distinct column, so we must have that each of the n columns has a leading 

1 . 

In a row echelon matrix, these leading Is must be arranged to lie on the diagonal. 

In a reduced row echelon matrix, each column with a leading 1 has Os above and below that 1. 

These restrictions mean that rref(v4) must be the identity matrix. 


0 Recall that a matrix A is symmetric if AA — A, where AA is the transpose of A. Is it true that if A is a 
symmetric matrix and in reduced row echelon form, then A is diagonal? If so, prove it. Otherwise, provide a 
counterexample. 

Proof. 

This is true. 

If A is in reduced row echelon form, then every term below the diagonal must be 0. 

That is, the entry dij — 0 for all i > j. 

If A is, additionally, symmetric, then for i < j we also have dji — dij = 0. 

These two facts together means that d^ — 0 whenever i ^ j. 

In this case, the only non-zero terms in the matrix A lie on the diagonal, and so A is a diagonal 
matrix. 




Gaussian-Jordan Elimination 


Definition 


Consider the m X n system of linear equations: 

di\X\ + ai2#2 + * * ' + 0,1 n x n = &1 

a 21 ^i + d 2 2 x 2 + * • • + CL2n x n = &2 

^31^1 + 0%2 x 2 + * * * + CL3n x n = &3 

O m 2 x 2 H - ’ * ' H - OjyijiXji b m 

The coefficient matrix of the system is 


an 

«12 

’ * din 

021 

P 

• to 

to 

d2n 

_ O m l 

Cim2 

djnn 


The augmented matrix of the system is 


Oil 

012 

din 

bi ' 

021 

022 

d2n 

b 2 

0/7)1 

Om2 

djnn 

bm 


[Gauss-Jordan Elimination] For a given system of linear equations, we can find a solution as follows. 
This procedure is called Gauss-Jordan elimination. 

1. Write the augmented matrix of the system of linear equations. 

2. Use elementaray row operations to reduce the augmented matrix into (reduced) row echelon 
form. 

3. Write the system of linear equation corresponding to the matrix in row echelon form. 

4. Solve the system using back substitution. 


Problems 


Solve the following system by transforming the augmented matrix to reduced echelon form. Indicate the 
elementary row operations you performed. 


Xi + X 2 - X 5 = 1 

x 2 + 2x 3 + x 4 + 3x 5 = 1 
xi - x 3 + x 4 + x 5 = 0 


Solution. 

(a) Solve the following system by Gauss-Jordan elimination 
















The augmented matrix of the system is 


1 

1 

0 

0 

-1 

1 " 

0 

1 

2 

1 

3 

1 

1 

0 

-1 

1 

1 

0 


We apply elementary row operations as follows. 


1 

1 

0 

0 

-1 

1 " 

0 

1 

2 

1 

3 

1 

1 

0 

-1 

1 

1 

0 


" 1 

0 

-2 

-1 

-4 

0 " 

0 

1 

2 

1 

3 

1 

0 

0 

1 

2 

5 

0 


1 1 

0 1 

0 

0 

-1 

3 

1 " 
1 

CM CM 

1 + 

2 

1 



0 -1 

-1 

1 

2 

-1 



R\ +2i?3 

R 2 —2Rs 

" 1 

0 

0 

3 

6 

0 " 

-» 

0 

1 

0 - 

-3 - 

7 

1 


0 

0 

1 

2 

5 

0 


The last matrix is in reduced row echelon form. 

The unknowns X\ , X2, #3 correspond to the leading l’s. Thus they are dependent variables and X4 
and X5 are independent (free) variables. 

From the last matrix, we see that the solution set is given by 

X\ — — 3 x 4 — 6x5 
X2 — 3^4 + 7X5 + 1 
X3 = —2X4 5X5 


for any numbers X4, X5. 

(b) Determine all possibilities for the solution set of a homogeneous system 

A homogeneous system is always consistent because it has the zero solution Xi = 0, X 2 =0. 

So, we already know the possibilities for the solution set are either a unique solution or infinitely 
many solutions. 

Note that the homogeneous system we are inspecting has a solution xi = 1, X 2 = 5, which is 
different from the zero solution xi = 0, X 2 = 0. Thus, the homogeneous system has at least two 
solutions. 

Thus, the only possibility is that the homogeneous system has infinitely many solutions. 


Solve the following system of linear equations using Gaussian elimination. 

x + 2y + 3z = 4 
5x + 6y + 7z — 8 
9x + 1C )y + 11 z = 12 

Elementary row operations 


The three elementary row operations on a matrix are defined as follows. 


(1) Interchanging two rows: 





















Ri Rj interchanges rows i and j. 

(2) Multiplying a row by a non-zero scalar (a number): 

tRi multiplies row i by the non-zero scalar (number) t . 

(3) Adding a multiple of one row to another row: 

Rj + tRi adds t times row i to row j. 


Solution. 

First, the augmented matrix of the system is 



" 1 

2 

3 

4 " 

A = 

5 

6 

7 

8 


9 

10 

11 

12 


We apply elementary row operations as follows to reduce the system to row echelon form. 


i?3 — 9i?i 

A - y 


" 1 

2 

3 

4 " 


" 1 

2 

3 

4 " 

5 

6 

7 

8 

-» 

5 

6 

7 

8 

0 

-8 

-16 

-24 


0 

1 

2 

3 



" 1 2 3 

4 " 

1 n 

12 3 

4 " 

i?2 — 5i?i 



~J R 2 



- » 

0 -4 -8 

-12 

- » 

0 12 

3 


0 12 

3 


0 12 

3 


i?3 —i?2 
-> 


1 2 
0 1 


3 
2 

0 0 0 


The last matrix is in row echelon form. 


The corresponding system of linear equations of it is 

x + 2y + 3z = 4 
y + 2z = 3 

0z = 0 


The last equation 0z = 0 means that z can be any number. 

(More systematically, the variables corresponding to leading 1 ’s in the echelon form matrix are 
dependent variables, and the rests are independent (free) variables.) 

So let us say that t is a value for z, namely z — t. 

Then from the second equation, we have y = — 2t + 3. 

From the first equation, we have 

x — —2 y — 3 z -|- 4 — —2(—2 1 -\- 3) — 3 1 -1-4 — t — 2. 

























Thus the solution set is 


(x, y, z) = (t- 2, -2 1 + 3, t) 


for any t . 

Comment. 

You may want to check whether the answer is correct by substituting this solution to the original 
equations. 

Also, if you further reduce the matrix into reduced row echelon form, the last system becomes simpler 
(and simplest in a sense). 

This procedure, to reduce a matrix until reduced row echelon form, is called the Gauss-Jordan 
elimination. 


Solve the following system of linear equations using Gauss-Jordan elimination. 

6x + Sy + 6z + 3w — —3 
6x — Sy + 6z — 3w = 3 
8 y 6ic = 6 

We use the following notation. 

Elementary row operations. 

The three elementary row operations on a matrix are defined as follows. 

(1) Interchanging two rows: 

Ri O Rj interchanges rows i and j. 

(2) Multiplying a row by a non-zero scalar (a number): 
tRi multiplies row i by the non-zero scalar (number) t . 

(3) Adding a multiple of one row to another row: 

Rj + tRi adds t times row i to row j. 

Solution. 

The augmented matrix of the system is 


’ 6 

8 

6 

3 

-3 " 

6 

-8 

6 

-3 

3 

0 

8 

0 

-6 

6 


We apply elementary row operations as follows to reduce the system to a matrix in reduced row 
echelon form. 









’ 6 

8 

6 

3 

-3 " 

R\ —Rz 
-> 

’ 6 

0 

6 

9 

-9 " 

0 

-16 

0 

-6 

6 

0 

0 

0 

-18 

18 

0 

8 

0 

-6 

6 

i?2 +2i?3 

0 

8 

0 

-6 

6 


3 i?l , lg R 2 

’ 2 

0 

2 

3 

-3 " 

R\ — 3i?2 

’ 2 

0 

2 

0 

0 " 

-» 

0 

0 

0 

1 

-1 

- > 

0 

0 

0 

1 

-1 

7^3 

0 

4 

0 

-3 

3 

Rz +3i?2 

0 

4 

0 

0 

0 


" 1 

0 

1 

0 

0 " 

R2 i-^-Rz 

" 1 

0 

1 

0 

0 " 

0 

0 

0 

1 

-1 

-> 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 


0 

0 

0 

1 

-1 


The last matrix is in reduced row echelon form. 

The corresponding system of linear equations is 

x + z = 0 

y = o 

W — — 1 

Let z — t be a free variable. Then the solution is (x, y, z, u;) = (—t, 0 , i, — 1 ) for any number i. 


Solve the following system of linear equations by transforming its augmented matrix to reduced echelon form 
(Gauss-Jordan elimination). Find the vector form for the general solution. 


X \ — — 3 x 5 — 1 

3x\ + X2 — Xs + X4 — 9^5 — 3 
X \ — #3 + £4 — 2 x $ — 1 . 


Solution. 


The augmented matrix of the given system is 


1 

0 

-1 

0 

-3 

1 " 

3 

1 

-1 

1 

-9 

3 

1 

0 

-1 

1 

-2 

1 


We apply the elementary row operations as follows. 
We have 


" 1 

0 

-1 

0 

-3 

1 " 

R 2 —3R\ 

Rz —R\ 

" 1 

0 

-1 

0 

-3 

1 " 

3 

1 

-1 

1 

-9 

3 

-» 

0 

1 

2 

1 

0 

0 

1 

0 

-1 

1 

-2 

1 


0 

0 

0 

1 

1 

0 


" 1 

0 

-1 

0 

-3 

1 " 

0 

1 

2 

0 

-1 

0 

0 

0 

0 

1 

1 

0 


The last matrix is in reduced row echelon form. 

From this reduction, we see that the general solution is 








































£i — £ 3 + 3x 5 + 1 
x 2 — -2x 3 + x 5 

£4 = —£ 5 . 


Here £ 3 , £5 are free (independent) variables and £ 1 , X 2 , £4 are dependent variables. 

To find the vector form for the general solution, we substitute these equations into the vector x as 
follows. 

We have 


XI 


£3 + 3£5 + 1 

X 2 


- 2£ 3 + £5 

£3 

= 

X 3 

£4 


£5 

_#5_ 


£5 


X3 


3£ 5 


"1' 

- 2 X 2 , 


£5 


0 

X3, 

+ 

0 

+ 

0 

0 


-£5 


0 

0 


_ ^5 _ 


.0. 


■ 1 ' 


' 3 ' 


" 1 " 

-2 


1 


0 

1 

+ X 5 

0 

+ 

0 

0 


-1 


0 

. 0 . 


. 1 . 


.0. 


Therefore the vector form for the general solution is given by 


' 1 ■ 


' 3 ' 


" 1 " 

-2 


1 


0 

1 

+ x 5 

0 

+ 

0 

0 


-1 


0 

. 0 . 


. 1 . 


.0. 


where £ 3 , £5 are free variables. 


The given matrix is the augmented matrix for a system of linear equations. Give the vector form for the general 
solution. 


’ 1 

0 

-1 

0 

-2 

0 " 

0 

1 

2 

0 

-1 

0 

1- 

0 

0 

0 

1 

1 

0 

1_ 


Solution. 

(a) Give the vector form for the general solution 



























The system of linear equation represented by the augmented matrix is 

X\ — X 3 — 2^5 = 0 

X 2 + 2x$ ~ X 5 = 0 
X4 + £5 = 0. 


Solving the preceding system, we obtain 

x\ = x 3 + 2x 5 

X 2 = - 2 X 3 + X 5 
X4 — — X5. 


Here X 2 , x$ are free variables and the rest are dependent variables. 
Hence the general solution x can be expressed as 



~Xi 


X 3 + 2^5 


X2 


-2x 3 + X 5 

X = 

X3 

— 

X3 


X 4 


— x 5 


_ x§ _ 


£5 


Xs 


" 2x 5 " 


■ 1 " 


' 2 ' 

-2x 3 


X5 


-2 


1 

X3 

+ 

0 

= x 3 

1 

+ x$ 

0 

0 


— X5 


0 


-1 

0 . 


_ ^5 _ 


. 0 . 


. 1 . 


Therefore, the vector form for the general solution is 


‘ 1 ' 


' 2 ' 

-2 


1 

1 

+ x § 

0 

0 


-1 

. 0 . 


. 1 . 


(b) Matrix product and transpose 

First of all, note that by the property of the transpose we have 

(A - B) t = A t - B t . 

Hence we can simply the middle part: 

A T - (A - B) t =A t - ( A t - B t ) 

= A t -A t + b t = b t . 


Thus the expression becomes 


v T ( A t - {A- B ) t ) C = v t B t C. 


















The transposes of v and B are 


v T = [ 0 1 0 ] and B T 


1 0 
0 1 
1 0 


Thus we have 


v T (A t - (A - B) t ) C = v t B t C 


= [0 

= [0 


1 0] 


1 
0 
1 

1 2 
0 6 


0 

1 

0 


1 2 
0 6 


= [0 6 ]. 


In conclusion, we have obtained 

v T (^4 t — (A — -B) T ) C = [0 6]. 


Solve the following system of linear equations and give the vector form for the general solution. 

X \ — — 2^5 — 1 

x 2 + 3^3 - x 5 = 2 
2 x 1 — 2x 3 + £4 — 3^5 — 0 


(The Ohio State University) 

Solution. 

We solve the system by Gauss-Jordan elimination. 
The augmented matrix of the system is given by 


1 

0 

-1 

0 

-2 

1 " 

0 

1 

3 

0 

-1 

2 

1- 

to 

0 

-2 

1 

-3 

0 


We apply the elementary row operations as follows. 


" 1 

0 

-1 

0 

-2 

1 " 

i?3 — 2 R\ 
- > 

" 1 

0 

-1 

0 

-2 

1 " 

O 

1 

3 

0 

-1 

2 

0 

1 

3 

0 

-1 

2 

2 

0 

-2 

1 

-3 

0 


0 

0 

0 

1 

1 

-2 


Then the last matrix is in reduced row echelon form. 

The variables X \, x 2 , #4 correspond to the leading l’s of the last matrix, hence they are dependent 
variables and the rest # 3 , £5 are free variables. 

From the last matrix we obtain the general solution 

Xi = Xz + 2a?5 + 1 
X 2 = - 3 X 3 + *5 + 2 

X4 = —X5 — 2. 




















The vector form for the general solution is obtained by substituting these into the vector x. 
We have 


Xi 


X 3 + 2x 5 + 1 

X 2 


—3^3 + X 5 + 2 

xs 

— 

Xs 

X4 


-x 5 - 2 

_ x§ _ 


£5 


■ 1' 


' 2 ' 


‘ 1 ' 

-3 


1 


2 

1 

+ x§ 

0 

+ 

0 

0 


-1 


-2 

. 0. 


. 1 . 


. 0 . 


Therefore, the vector form for the general solution is given by 


" 1 ' 


' 2 ' 


' 1 " 

-3 


1 


2 

1 

+ x 5 

0 

+ 

0 

0 


-1 


-2 

. 0 . 


. 1 . 


. 0 . 


where £3, X5 are free variables. 


Determine whether the following augmented matrices are in reduced row echelon form, 
sets of their associated systems of linear equations. 



" 1 

0 

0 


2 

(a) 

0 

1 

0 


-3 


0 

0 

1 


6 

(b) 

1 

0 

0 

3 


-4 

0 

1 

2 



1 

2 

0 


(c) 






1 

1 

-1 



Solution. 



’ 1 

0 

0 

2 " 

(a) 

0 

1 

0 

-3 


0 

0 

1 

6 


The first matrix is in reduced row echelon form. 


and calculate the solution 


For this matrix, we can read off the solution X\ — 2 , X2 — — 3, CC 3 = 6 . 





























The second matrix is in reduced row echelon form. 

For this matrix, the variable £3 is a free variable because there are no leading Is in the 3rd column. 
The solution set can be expressed as X\ — —4— 3^3 , X 2 — — 2#3 , and #3 can be any real number. 





(C) 

1 2 

0 

1 1 

-1 





The third matrix is not in reduced echelon form because the bottom-left entry is 1, not 0, so we first 
use elementary row operations to put it in this form. 


1 

2 

0 

R2 — R1 
-» 

1 

2 

0 

1 

1 

-1 


0 

-1 

-1 


(-i)Jfe 

1 

2 

0 " 

R\ -2R 2 

1 

0 

-2 " 

-> 

0 

1 

1 

-> 

0 

1 

1 


With the reduced matrix, we can read off the solution X\ = — 2 and #2 = 1. 


0; Find a cubic polynomial p(x) = a -\-bx + cx 2 + dx 3 such that p(l) = 1,//(1) = 5,p( — 1) = 3, and 

p'(- 1 ) = 1 . 


Solution. 

By differentiating p(x), we obtain 


p\x ) = b + 2cx + 3dx 2 - 

Thus the given conditions are 

p(l) —a + b + c + d— 1 
p \ 1 ) = 6 + 2 c + 3 d = 5 
p(— 1) = a — b + c — d = 3 
p'(-l) = 6 — 2 c + 3 d = 1 . 

We want to solve these four equations for a, b, c, d. So a, b, c, d are unknowns and we regard the 
conditions as a system of four linear equations. 

We solve the system using Gauss-Jordan elimination. 

The augmented matrix for the system is 


‘ 1 111 

1 " 

0 12 3 

5 

1-1 1-1 

3 

0 1-23 

1 


We apply the elementary row operations as follows. 
























’ 1 

1 

1 

1 

1 ' 


' 1 

1 

1 

1 

1 ' 

0 

1 

2 

3 

5 

0 ? 

1 

CO 

0 

1 

2 

3 

5 

1 

-1 

1 

-1 

3 

0 

-2 

0 

-2 

2 

0 

1 

-2 

3 

1 


0 

1 

-2 

3 

1 


Ri —R2 
Rs + 2 R 2 

R 4 — R 2 


' 1 

0 

-1 

-2 

-4 ' 


’ 1 

0 

-1 

-2 

-4 ' 

0 

1 

2 

3 

5 

1*3 
-> 

0 

1 

2 

3 

5 

0 

0 

4 

4 

12 

0 

0 

1 

1 

3 

_ 0 

0 

-4 

0 

-4 


_ 0 

0 

1 

0 

1 


R\ -\-R 4 

R 2 — 2i?4 
Rs —i?4 


1 

0 

0 

-2 

-3 ' 


~ 1 

0 

0 

-2 

-3 ' 

0 

1 

0 

3 

3 

Rs 

V 

0 

1 

0 

3 

3 

0 

0 

0 

1 

2 

- 7 

0 

0 

1 

0 

1 

0 

0 

1 

0 

1 


0 

0 

0 

1 

2 


R\ -\-2R4. 

R 2 — 3i? 4 


' 1 

0 

0 

0 

1 ' 

0 

1 

0 

0 

1 

OJ 

0 

0 

1 

0 

1 

0 

0 

0 

1 

2 


The last matrix is in reduced row echelon form. Thus the solution to the system is 

a = 1,6 = —3, c — l,rf = 2. 


Therefore, the polynomial is 


p(x) = 1 — 3x + # 2 + 2x 3 . 


Find a quadratic function /(#) = ax 2 + bx + c such that /(l) = 3, /'(l) = 3, and /"(1) = 2. Here, /'(#) 
and f"(x) denote the first and second derivatives, respectively. 

Solution. 

Each condition required on f can be turned into an equation involving the constants a, 6 , c. 

In particular, /(1) = 3 tells us that a + b + c — 3 . 

Because f'{x ) = 2 ax + 6, the condition 1) = 3 gives us 2a + b = 3 . 

And finally f n {x ) = 2a, and so /^(l) = 2a — 2. Thus we have the system of equations 

a + 6 + c = 3 
2a + 6 = 3 
2 a = 2 


To solve this system, we could create the augmented matrix and then reduce it. 

For this system, though, it is simpler to solve directly. The third equation tells us that a — 1. 
Plugging this value into the second equation, we find 6=1. 

Plugging both of these values into the first equation, we see c = 1 as well. 





























Thus the function we want is f(x) 


x 2 + x + 1. 


(a) Find a function g(0) — acos(0) + 6cos(20) ■+ ccos(30) such that ^(0) = g(ir/2) — g(ir) — 0, where 
a, 6, c are constants. 

(b) Find real numbers a, 6, c such that the function g(0) — a cos (9) + b cos (28) + c cos(3 8) satisfies 
g( 0) = 3,flf(7r/2) = l,andflf(7r) -5. 


Solution. 


(a) Condition: g(0) = g( 7t/ 2) = g(7r) = 0 


Each condition required on g can be turned into an equation involving the constants a, 6 , c. In 
particular, we have the system of linear equations 


^(0) =a+6+c=0 



g( 7 r) = —a + b-c — 0. 


To solve this system, we will use Gauss-Jordan elimination to reduce its augmented matrix. 


1 

1 

1 

0 " 

R\ +i?2 

1 

0 

1 

0 " 

(-l)ft 

' 1 

0 

1 

0 " 

0 

-1 

0 

0 

- » 

0 

-1 

0 

0 

- > 

0 

1 

0 

0 

-1 

1 

-1 

0 

i?3+i?2 

-1 

0 

-1 

0 

Rs -\-Ri 

0 

0 

0 

0 


The solution can now be read off: a + c — 0 and 6 = 0. 

Thus a general solution is of the form g(9) = a cos(9)— a cos (30), where a is any real number. 

(b) Condition: ^(0) = 3,p(7r/2) = l,andg(7r) = —5 

Each condition required on g can be turned into an equation involving the constants a, 6 , c. In 
particular, we have 

g( 0) = a + 6 + c = 3 
g{n/2) = -6=1 

g( 7 r) = —a + b-c = — 5 . 


To solve this system, we will use Gauss-Jordan elimination to reduce its augmented matrix. 


1 

1 

1 

3 " 

Rl +i ?2 

1 

0 

1 

4 " 

(- 1)«2 

" 1 

0 

1 

4 ' 

0 

-1 

0 

1 

- > 

0 

-1 

0 

1 

-> 

0 

1 

0 

-1 

-1 

1 

-1 

-5 

R 3 +R 2 

-1 

0 

-1 

-4 

i?3 +i?i 

0 

0 

0 

0 


The solution can now be read off: a + c — 4 and 6 = — 1. Thus a general solution is of the form 

g{9) = acos(0)-cos(20) + (4-a)cos(30), 
























where a is any real number. 


A 2-digit number has two properties: The digits sum to 11, and if the number is written with digits reversed, and 
subtracted from the original number, the result is 45. Find the number. 

Solution. 

The key to this problem is noticing that our 2-digit number can be written as 10^4 + B , where A, B 
are the 10s and Is digit respectively. 

The digits summing to 11 then yields the equality A + B — 11. 

The number with digits reversed is 10 B + A, and so the second property yields the equation 
10 A + B— (10 B + A) = 45. Simplifying, we have the system of equations 

A + B= 11 
9A - 9B = 45 


To solve this system, we create the augmented matrix and use elementary row operations to put it into 
reduced row-echelon form: 


1 

1 

11 

f?2 —9i?i 
- > 

1 

1 

11 

9 

-9 

45 

0 

-18 

-54 


18 

1 

1 

11 

S 3 

Sa 

to 

1 

0 

8 

-> 

0 

1 

3 

-> 

0 

i 

3 


With this final augmented matrix, we can read off the solution A — 8 and B — 3. 


Thus our number is 10 A + B — 83. 


















Consistent and Inconsistent Systems of Linear Equations 


Definition 


Q An m X n system of linear equations is 

a n xi + a u x 2 H-h a\ n x n = &i (*) 

& 21#1 + & 22#2 + * * * + CL2n%n = &2 
& 31#1 + & 32#2 + * * * + Ci3n%n = &3 

&m\X\ + CLm2X2 + * * * + Q'mnX n — 

where x \, # 2 ,. . ., are unknowns (variables) and a^, bk are numbers. 

Thus an m X n system of linear equations consists of m equations and n unknowns x \, x 2 ,..., x n . 

A system of linear equations is called homogeneous if the constants b \, b 2 ,. . ., b m are all zero. 

A solution of the system (*) is a sequence of numbers Si, 52,. .., s n such that the substitution 
X\ — s i, #2 = 52 ,..., x n = s n satisfies all the m equations in the system (*). 

The rank of a system of linear equation is the rank of the coefficient matrix. 


Summary 


For a given system of linear equations, there are only three possibilities for the solution set of the 
system: No solution (inconsistent), a unique solution, or infinitely many solutions. 

The possibilities for the solution set of a homogeneous system is either a unique solution or infinitely 
many solutions. 

If m < Tl, then an m X n system is either inconsistent or it has infinitely many solutions. 

If m < n, then an m X n homogeneous system has infinitely many solutions. 

If a consistent m X n system of linear equation has rank r, then the system has n — r free variables. 


Problems 

Determine all possibilities for the solution set of the system of linear equations described below. 

(a) A homogeneous system of 3 equations in 5 unknowns. 

(b) A homogeneous system of 5 equations in 4 unknowns. 

(c) A system of 5 equations in 4 unknowns. 

(d) A system of 2 equations in 3 unknowns that has X\ — 1, #2 = — 5, CC 3 = 0asa solution. 

(e) A homogeneous system of 4 equations in 4 unknowns. 

(f) A homogeneous system of 3 equations in 4 unknowns. 

(g) A homogeneous system that has x \ — 3 , £2 — —2, CC3 = 1 as a solution. 

(h) A homogeneous system of 5 equations in 3 unknowns and the rank of the system is 3. 
















(i) A system of 3 equations in 2 unknowns and the rank of the system is 2. 

(j) A homogeneous system of 4 equations in 3 unknowns and the rank of the system is 2. 

the possibilities for the solution set of a system of linear equations 

An m x n system of linear equations is 

a ll x l + a l2 x 2 +-+a ln x n 
a 2l x l +a 22 x 2 + -+a 2n x n 

a 31*l + a 32*2 + ”• + a 3»*» 

a ml x l + a m2 X 2 + - + a mn x n = b m> 

where Xy, x 2 , •. x n are unknowns (variables) and a t p b k are numbers. 

Thus an m * n system of linear equations consists of m equations and n unknowns Xy, x 2 , .. x n . 
A system of linear equations is called homogeneous if the constants by, b 2 , .. b m are all zero. 
Namely, a homogeneous system is 


= b \ 
= b 2 



a u x l +a l2 x 2 + - + a ln x n = 0 
a 2l x l +a 22 x 2 + -+a 2rr x n = 0 
a 31 x 1 +a 32 x 2 + -+ a 3rt Xn = 0 

a ml x l +a m2 x 2 + - + a mn x n = 0 - 

A solution of the system (*) is a sequence of numbers sq, s 2 , .. s n such that the substitution 
Xy = Sy,x 2 = s 2 , . • x n = s n satisfies all the m equations in the system (*). 

We sometimes use the vector notation and say 



is a solution of the system. 

For example, every homogeneous system has the zero solution xy = 0, x 2 = 0, ..., x n = 0, or 


■ 0 1 

0 


x 


0 


Here we summarize several theorems concerning with the possibilities for the number of solutions of 
a system of linear equations. 

We say a system is consistent if the system has at least one solution. 

A system is called inconsistent if the system has no solutions at all. 








Theorem 1. For a given system of linear equations, there are three possibilities for the solution 
set of the system: No solution (inconsistent), a unique solution, or infinitely many solutions. 


Thus, for example, if we find two distinct solutions for a system, then it follows from the theorem that 
there are infinitely many solutions for the system. 

Next, since a homogeneous system has the zero solution, it is always consistent. Thus: 


Theorem 2. The possibilities for the solution set of a homogeneous system is either a unique 
solution or infinitely many solutions. 


Let us refine these theorems. Suppose that an m x n system of linear equations is given. That is, there 
are m linear equations and n unknowns. 


Theorem 3. If m < n, then the system is either inconsistent or it has infinitely many solutions. 


Thus, there are only two possibilities when m < n: No solution or infinitely many solutions. 


Theorem 4. Consider m x n homogeneous system of linear equations. Then the system has 
always infinitely many solutions. 


This is obtained by noting that a homogeneous system always has the zero solution, hence consistent. 
By the previous theorem, the only possibility is infinitely many solutions. 

Summary 1 


Summary 1: The number of solutions of a system of linear equations is one of 0, 1, or oo. If there 
are more unknowns (n) than the number of equations ( m ), then the number of solutions of the 
system is either 0 or go. If a system is homogeneous, then it has the zero solution and thus a 
homogeneous system is always consistent. 

The case m>nt 

What happens when m>nl 

In general, when the number of equations is greater than or equal to the number of unknowns, we 
cannot narrow down the possibilities. 

We need more information about the system. The key word is the rank of the system. 

For a given system (*), let A be the coefficient matrix and let b be the constant term vector. Then we 
define the rank of the system to be the rank of the augmented matrix [A | b]. 

Recall that the rank is defined as follows. We first reduce the matrix [A | b] to a matrix [A | b ] in 


(reduced) row echelon form by elementary row operations. 

Then the rank of [A | b] is the number of nonzero rows in the matrix [A \ b ]. 


Theorem 5. Consider an^x/j system of linear equations. Suppose that it is consistent. Then the 
rank r of the system satisfies r < n. Also, the system has n - r free variables. 


A free variable means an unknown that can be assigned arbitrary values. It follows that if a system has 
a free variable, then there are infinitely many solutions. 

Caution: the theorem assumes that a given system is consistent. 

Summary 2 


Summary 2: Suppose that an/wx/i system of linear equations is consistent and let r be the rank 
of the system. If n = r, then the system has a unique solution. If n > r, then the system has 
infinitely many solutions. 


Problems and solutions 

Determine all possibilities for the solution set of the system of linear equations described below. 

(a) A homogeneous system of 3 equations in 5 unknowns. 

(b) A homogeneous system of 5 equations in 4 unknowns. 

(c) A system of 5 equations in 4 unknowns. 

(d) A system of 2 equations in 3 unknowns that has x l = 1, x 2 = ~ 5, x 3 = 0 as a solution. 

(e) A homogeneous system of 4 equations in 4 unknowns. 

(f) A homogeneous system of 3 equations in 4 unknowns. 

(g) A homogeneous system that has x 1 = 3, x 2 = ~ 2, x 3 = 1 as a solution. 

(h) A homogeneous system of 5 equations in 3 unknowns and the rank of the system is 3. 

(i) A system of 3 equations in 2 unknowns and the rank of the system is 2. 

(j) A homogeneous system of 4 equations in 3 unknowns and the rank of the system is 2. 

Solutions. 

In the solution, m denotes the number of equations and n denotes the number of unknowns in the 
given system. 

(a) A homogeneous system of 3 equations in 5 unknowns. 

Since the system is homogeneous, it has the zero solution, hence consistent. Since there are more 
unknowns than equations, there are infinitely many solutions. 



(b) A homogeneous system of 5 equations in 4 unknowns. 

Since the system is homogeneous, it has the zero solution. Since there are more equations than 
unknowns, we cannot determine further. 

Thus the possibilities are either a unique solution or infinitely many solution. 

(If the rank r of the system is 4, then a unique solution. If r < 4, then there are infinitely many 
solutions.) 

(c) A system of 5 equations in 4 unknowns. 

Since m > n, we can only say that the possibilities are no solution, a unique solution, or infinitely 
many solution. 


(d) A system of 2 equations in 3 unknowns that has xjr 1, x 2 = ~ 5, x 3 = 0 as a solution. 

Since m < n, the system is either inconsistent or has infinitely many solutions. Since 
X ] = 1, x 2 = - 5, x 3 = 0 is a solution of the system, the system is not inconsistent. Thus the only 
possibility is infinitely many solutions. 

(e) A homogeneous system of 4 equations in 4 unknowns. 

Since m = n, this tells nothing. But since the system is homogeneous it has the zero solution, hence 
consistent. The possibilities are either a unique solution or infinitely many solutions. 


(f) A homogeneous system of 3 equations in 4 unknowns. 

Since m < n, the system has no solution or infinitely many solutions. But a homogeneous system is 
always consistent. Thus, the only possibility is infinitely many solutions. 

(g) A homogeneous system that has x 1 = 3, x 2 = - 2, x 3 = 1 as a solution. 

The possibilities for the solution set for any homogeneous system is either a unique solution or 
infinitely many solutions. Since the homogeneous system has the zero solution and 
Xi = 3, x 2 = - 2, x 3 = 1 is another solution, it has at least two distinct solution. Thus the only 
possibility is infinitely many solutions. 


(h) A homogeneous system of 5 equations in 3 unknowns and the rank of the system is 3. 

A homogeneous system is always consistent. Since the rank r of the system and the number n of 
unknowns are equal, the only possibility is the zero solution (and the zero solution is a unique 
solution). 


(i) A system of 3 equations in 2 unknowns and the rank of the system is 2. 

We don’t know whether the system is consistent or not. 

If it is consistent, then since the rank r and the number of unknowns are the same, the system has a 
unique solution. Thus the possibilities are either inconsistent or a unique solution. 

Before talking about the rank, we need to discuss whether the system is inconsistent or not. For 
example, consider the following 3x2 system 

x \ + x 2 = 1 
2x 1 + 2x 2 = 3 
3x 1 + 3x 2 = 3. 

Then the augmented matrix is 

'1 1 1 ' 

2 2 3. 

3 3 3 

We reduce this by elementary row operations as follows. 


■1 

1 

1 ' 

7 v 2 y 

*3-3*1 

■1 

1 

1 ■ 

2 

2 

3 


0 

0 

1 

3 

3 

3 


0 

0 

0 


The last matrix is in echelon form and it has two nonzero rows. Thus, the rank of the system is 2. 
However, the second row means that we have 0=1. Hence the system is inconsistent. 


(j) A homogeneous system of 4 equations in 3 unknowns and the rank of the system is 2. 

A homogeneous system is consistent. The rank is r = 2 and the number of variables is n = 3. Hence 
there is n — r= 1 free variable. Thus there are infinitely many solutions. 


Determine all possibilities for the number of solutions of each of the system of linear equations described below. 

(a) A system of 5 equations in 3 unknowns and it has X\ — 0, £2 = — 3, £3 = 1 as a solution. 

(b) A homogeneous system of 5 equations in 4 unknowns and the rank of the system is 4. 

(The Ohio State University) 


Hint. 

See the post Summary: possibilities for the solution set of a system of linear equations to review how to 
determine the number of solutions of a system of linear equations. 








Solution. 

Let m be the number of equations and n be the number of unknowns in the given system. 

Note that the information m > n tells us nothing about the possibilities for the number of solutions. 

(a) A system of 5 equations in 3 unknowns and it has x i = 0, x 2 = — 3, x 3 = lasa solution. 

The system has at least one solution X\ — 0,X2 = — 3,X3 = 1, hence it is consistent. 

Thus, the system has either a unique solution or infinitely many solutions. Since m > n , we cannot 
narrow down the possibilities further. Thus, the possibilities are either a unique solution (which must be 
X\ — 0, x2 = —3,X3 = 1 ) or infinitely many solution. 

(b) A homogeneous system of 5 equations in 4 unknowns and the rank of the system is 4. 

Since the system is homogeneous, it has always the zero solution, hence it is consistent. 

The fact m > n gives no new information. But since the rank r of the system is equal to the number of 
unknowns n , there is no free variable, (n — r — 0 .) Thus, the system must have a unique solution, which 
is the zero solution. 

Comment. 

This is one of the midterm exam 1 problems of linear algebra (Math 2568) at the Ohio State University. 

Some students wrongly concluded from the condition m > n . Again, note that the condition m > n 
gives no new information at all. 

Determine all possibilities for the solution set of a homogeneous system of 2 equations in 2 unknowns that has a 
solution X\ = l,x 2 = 5 . 

(b) Determine all possibilities for the solution set of a homogeneous system 

A homogeneous system is always consistent because it has the zero solution x\ — 0, X 2 — 0. 

So, we already know the possibilities for the solution set are either a unique solution or infinitely 
many solutions. 

Note that the homogeneous system we are inspecting has a solution X\ — 1, #2 — 5, which is 
different from the zero solution X\ — 0, X 2 — 0. Thus, the homogeneous system has at least two 
solutions. 

Thus, the only possibility is that the homogeneous system has infinitely many solutions. 


For what value(s) of a does the system have nontrivial solutions? 

Xi + 2 x 2 +£3=0 
- Xi - x 2 + x 3 = 0 

3 xi + 4^2 + ax 3 — 0. 

Solution. 

First note that the system is homogeneous and hence it is consistent. Thus if the system has a 
nontrivial solution, then it has infinitely many solutions. 


This happens if and only if the system has at least one free variable. The number of free variables is 
n — r, where n is the number of unknowns and r is the rank of the augmented matrix. 


To find the rank, we reduce the augmented matrix by elementary row operations. 


1 

2 

1 

0 " 

~\~Rl 

" i 

2 

1 

0 " 

-1 

-1 

1 

0 

- > 

0 

1 

2 

0 

3 

4 

a 

0 

Rs — 3.Ri 

0 

-2 

a — 3 

0 


Rs +2i?2 
- > 


" 1 

2 

1 

o " 

0 

1 

2 

0 

0 

0 

CL -hi 

0 


The last matrix is in row echelon form. 

Thus if a + 1 = 0, then the third row is a zero row, hence the rank is 2. In this case we have 
n — r — 3 — 2 = 1 free variable. Thus there are infinitely many solutions. In particular, the system 
has nontrivial solutions. 


On the other hand, if a + 1 0, then the rank is 3 and there is no free variables since 

n — r — 3 — 3 = 0. 

In summary, the system has nontrivial solutions exactly when a — — 1. 


Suppose that the following matrix A is the augmented matrix for a system of linear equations. 


1 

2 

3 

4 " 

2 

-1 

-2 

a 2 

-1 

-7 

-11 

a 


where a is a real number. Determine all the values of a so that the corresponding system is consistent. 


Solution. 


We apply the elementary row operations to A as follows. 



1 

2 

3 

4 " 

i?2 —2Ri 

Rs ~\~Ri 

" i 

2 

3 

4 ' 

A = 

2 

-1 

-2 

a 2 

- > 

0 

-5 

-8 

a 2 — 8 


-1 

-7 

-11 

a 


0 

-5 

-8 

a + 4 


1 

2 

3 

4 " 

0 

-5 

-8 

a 2 — 8 

_ 0 

0 

0 

— (x 2 a 12 


The last matrix is in row echelon form. (It’s not in reduced row echelon form.) 

Then the system is consistent if and only if —a 2 + a + 12 is zero. (See Remark below.) 
Since we can factor 



























0 — — ci/ a 12 — — (cl + 3) (a — 4), 

we see that the system is consistent if and only if a = —3, 4. 

Remark 

If—a 2 + a + 12 0, then we divide the third row by —a 2 + a + 12 and obtain 

0 0 0 | 1 

in the third row. 

The corresponding linear equation is 


Oxi + 0x 2 + 0x 3 = 1, 

and clearly, there is no solution to this equation. Hence the system is inconsistent. 

On the other hand, if—a 2 + a + 12 = 0, then the system has solutions. (You just need to reduce 
the above matrix further or use the back substitution.) 


Determine whether the following systems of equations (or matrix equations) described below has no solution, one 
unique solution or infinitely many solutions and justify your answer. 

(a) 

( ax + by — c 
\dx + ey = /, 

where a, 6, c, d are scalars satisfying a/d — b/e — c/ f. 

(b) Ax = 0, where A is a non-singular matrix. 

(c) A homogeneous system of 3 equations in 4 unknowns. 

(d) Ax = b, where the row-reduced echelon form of the augmented matrix [A|b] looks as follows: 

"1 0 -1 0 " 

0 12 0 . 

0 0 0 1 


(The Ohio State University) 

Hint. 

Recall that possibilities for the solution set of a system of linear equation is either no solution 
(inconsistent) or one unique solution or infinitely many solutions. 

A homogeneous system is a system with zero constant terms. 

A homogeneous system always has the zero solution. 

Solution. 

(a) Note that by the given condition a/d = b/e = c/ f, the numbers 6, e, c are not zero. The 
augmented matrix of the system is 


a b 

c 

d e 

-1 


We apply the elementary row operations as follows. 









a 

d 


b 

c 

R l -~ d R 2 

e 

/. 



1 

e/d 

f/d ' 

0 

0 

0 


0 0 0 
d e f 


R\ 4~}R2 

d e 

/ 

-» 

0 0 

0 


Here in the first step, we used the relation a/d — b/e — c/ f. 

The last matrix is in reduced row echelon form with rank 1 and it does not have a row of the form 
[00 1 1] . Therefore the system is consistent and there are 

1(= the number of unknowns — rank) free variable, hence there are infinitely many 
solutions. 


(b) The system is homogeneous, thus it has the zero solution. Since the coefficient matrix A is 
singular, the system has non-zero solution as well. 

Therefore, the only possibility is that the system has infinitely many solutions. 


(c) A homogeneous system has the zero solution hence it is consistent. Since there are more 
unknowns than equations, the system must have infinitely many solutions. 


(d) The last row of the reduced row echelon form matrix of the augmented matrix is [00011]. 
It corresponds to the equation 


Oxi + 0^2 + 0x3 = 1. 

Equivalently, this is 0 = 1 and this is impossible. Thus the system has no solution (an inconsistent 
system). 
















Linear Combination and Linear Independence 


Definition 


Q The expression c\ Vi + C 2 V 2 + • • • + C&V& is called a linear combination of vectors 
vi,v 2 ,.. . ,Vjfe G R n , where Ci, C 2 ,.. ., Ck are scalars in R. 

A set of vectors {vi, V 2 ,. . ., v&} is said to be linearly independent if the only scalrs 

Cl, C 2 ,..., Ck satisfying Ci Vi + C 2 V 2 + • • • + C&V& = 0 are ci = C 2 = • • • = c& = 0. We 

also say that the vectors Vi, V 2 ,. . ., v& are linearly independent. 

If vectors are not linearly independent, they are linearly dependent. 


Summary 


The set of n-dimensional vectors {vi, V 2 ,. . ., v&} are linearly dependent if k > n. (If there are 
more vectors than the dimension, then the vectors are linearly dependent.) 


Problems 


O 

Express the vector b 


2 

13 

6 


as a linear combination of the vectors 



(The Ohio State University, Linear Algebra Exam) 


Solution. 

We need to find numbers x i , x 2 , x 3 satisfying 

X1V1 +x 2 v 2 +X3V3 = b. 

This vector equation is equivalent to the following matrix equation. 

[viv 2 v 3 ]x = b 

or more explicitly we can write it as 


■ 1 11' 


Xl 


' 2 ' 

5 2 4 


X 2 

= 

13 

.-1 1 3 


_* 3 _ 


. 6 . 
































Thus the problem is to find the solution of this matrix equation. 

Let us consider the augmented matrix for this system to apply Gauss-Jordan elimination. 
The augmented matrix is 


1 

1 

1 

2 " 

5 

2 

4 

13 

-1 

1 

3 

6 


We apply elementary row operations and obtain a matrix in reduced row echelon form as follows. 


111 

2 " 

*2-512l 
-» 

" 1 

1 1 

5 

2 4 

13 

0 

-3 -1 

-1 

1 3 

6 

*3+*l 

0 

2 4 


1 , 


111 

2 


7*3 




*2^*3 



0 3 1 

-3 

- » 

-R 2 

0 12 

4 


*1 —*2 

" 1 

0 -1 


-2 " 


- > 

0 

1 2 


4 

- » 

*3—3*2 

0 

0 -5 


-15 



Ill 
0 12 
0 3 1 

10-1 

0 1 2 

0 0 1 


2 

4 

-3 

-2 

4 

3 


*l+*3 

" 1 

0 

0 

1 " 

-> 

0 

1 

0 

-2 

*2—2*3 

0 

0 

1 

3 


Therefore the solution for the system is 

x\ — 1,X2 — —2, andx3 = 3 

and we obtain the linear combination 

b = vi — 2 v2 + 3v3. 


o 

Write the vector 


" 1 " 

3 

as a linear combination of the vectors 

" 1 ' 

0 


" 2 " 

-2 


"2" 

0 

-1_ 


. 0 . 


_ 1 _ 


_4_ 


Solution. 


We want to find real numbers x \, X 2 , £3 which solve the equation 


" 1" 


"1' 


■ 2' 


'2' 


£1 + 2 x 2 + 2X3 

3 

_-i_ 

= xi 

0 

.0. 

+ £2 

-2 

. 1 _ 

+ aj 3 

0 

.4. 

— 

-2X2 
x 2 + 4 x 3 


Each row in this matrix now gives an equality, and so the three rows define a system of linear 
equations. To find a solution, we reduce its augmented matrix: 


















































1 

2 

2 

1 " 

Ri -2R 3 
-» 

" 1 

0 

-6 

3 " 

R2 V^i?3 
-> 

" 1 

0 

-6 

3 " 

0 

-2 

0 

3 

0 

0 

8 

1 

0 

1 

4 

-1 

0 

1 

4 

-1 

R2 +2i?3 

0 

1 

4 

-1 


0 

0 

8 

1 



" 1 

0 

-6 

3 " 


1 

0 

0 

15 

4 

-> 

0 

1 

4 

-1 

R\ +6-R3 
-> 

0 

1 

0 

^3 

2 


0 

0 



R2 -4i?3 






1 

1 

8 

0 

0 

1 

1 

8 


15 3 1 

Now we can read off the solution X\ — -j- , x 2 = and X 3 = ^. 


O 


For what value(s) of a is the following set S linearly dependent? 



Solution. 

(a) Linearly dependent vectors 

Since the set S consists of five 4-dimensional vectors, it is linearly dependent regardless of the value of a. 
Thus, for any value of a the set S is linearly dependent. 

(b) Perpendicular nonzero vectors are linearly independent 
Suppose that we have the linear combination 

C 1 V 1 + c 2 v 2 + C 3 V 3 = 0 (*) 

for some scalars C \, c 2 , C 3 . 

To show that the set is linearly independent, we must show that C\ — c 2 = C 3 = 0 . 

Taking the dot product with v 1 , we have 

Vi • (ciVi + c 2 v 2 + C 3 V 3 ) = Vi • 0 = 0. 


Distributing v 1 , we have 

C 1 V 1 • vi + c 2 vi • v 2 + c 3 vi • v 3 = 0 . 
Since the dot products vi • v 2 = 0,vi • V 3 = 0,we have 

II ||2 n 

Ci||Vi|| = C 1 V 1 • Vi = 0. 

Since the vector v 1 7 ^ 0, the length 11 v| | 7 ^ 0 . 

Thus, we must have C\ — 0 . 

































We repeat the above argument with v i replaced by v 2 as follows. 
Since C\ — 0, the (*) becomes 


C2V2 + C3V3 = 0. 


We have 


0 = V2 • 0 = V2 • (c 2 V 2 + C3V3) 
= C 2 V 2 • V 2 + C3V2 • V 3 . 


Since V 2 * V 3 = 0, we have 

n ll 112 

0 = C 2 V 2 • V 2 = C 2 11 v 2 II • 
Since x 7 ^ 0 , we have 1 1 v 211 7^ 0 , and thus c 2 — 0 . 


Hence (*) is now just C3 V3 = 0, and since X3 7 ^ 0, we conclude that C3 = 0. Therefore, we have 
shown that C\ — c 2 — C3 = 0, and thus the set {vi, V2 , V3} is linearly independent. 


Prove that any set of vectors which contains the zero vector is linearly dependent. 

Solution. 

Let 0 be the zero vector, and Vi, • • •, are the other vectors in the set. 

Then we have the non-trivial linear combination 

1*0 + Ovi + 0v2 + • • • + Ovfc = 0. 

This is a non-trivial linear combination because one of the coefficients is non-zero. 
Thus by definition, the set {0, Vi, • • •, v^} is linearly dependent. 


Let { vi, V2, V3 } be a set of nonzero vectors in M m such that the dot product mathbfvi • v j = 0 when i 7^ j. 
Prove that the set is linearly independent. 


Solution. 

(a) Linearly dependent vectors 

Since the set S consists of five 4-dimensional vectors, it is linearly dependent regardless of the value of a. 
Thus, for any value of a the set S is linearly dependent. 

(b) Perpendicular nonzero vectors are linearly independent 
Suppose that we have the linear combination 

C1V1 + c 2 y 2 + C3V3 = 0 (*) 

for some scalars C\ , c 2 , C3 . 

To show that the set is linearly independent, we must show that C\ — c 2 — C3 = 0 . 





Taking the dot product with v i , we have 


Vi • (civi + c 2 v 2 + C 3 V 3 ) = Vi • 0 = 0. 


Distributing v 1 , we have 


C 1 V 1 • vi + c 2 v 1 • v 2 + C 3 V 1 • v 3 = 0 . 
Since the dot products Vi • v 2 = 0, Vi • V 3 =0,we have 

II ||2 n 

Ci||Vi|| = C 1 V 1 • Vi = 0. 

Since the vector v 1 7 ^ 0, the length 11 v| | 7 ^ 0 . 

Thus, we must have c\ — 0. 

We repeat the above argument with v 1 replaced by v 2 as follows. 

Since C\ — 0, the (*) becomes 


c 2 v 2 + C3V3 = 0 . 


We have 


0 = v 2 • 0 = v 2 • (c 2 v 2 + C3V3) 
= C 2 v 2 • v 2 + c 3 v 2 • v 3 . 


Since v 2 • V 3 = 0, we have 

0 = C 2 V 2 • V 2 = C 2 11 V 2 11 2 . 

Since x 7 ^ 0 , we have 11 v 2 11 7 ^ 0, and thus c 2 = 0. 

Hence (*) is now just C 3 V 3 = 0, and since X 3 7 ^ 0, we conclude that C 3 = 0. Therefore, we have 
shown that C\ — c 2 = C3 = 0, and thus the set {vi, v 2 , V 3 } is linearly independent. 


Determine whether the following set of vectors is linearly independent or linearly dependent. If the set is linearly 
dependent, express one vector in the set as a linear combination of the others. 



Solution. 

Consider the linear combination 


■ 1 ' 


'1 


■-1' 


■-2' 

0 

+ X2 

2 

+ X3 

-2 

0 

+ X4 

-2 

-1 

3 

7 

. 0 . 


.4. 


. 1 . 


. 11. 




















with variables Xi,X 2 ,X 3 ,X 4 . 

We determine whether there is(xi,X 2 ,X 3 ,X 4 ) 7 ^ (0,0, 0,0) satisfying the linear combination (*). 
The linear combination (*) is written as the matrix equation 


‘ 1 1-1 - 2 " 


Xi 

0 2 - 2-2 


*2 

-13 0 7 


*3 

. 0 4 1 11. 


-X4_ 


To find the solutions of the equation, we apply the Gauss-Jordan elimination. 

The augmented matrix [A | 0] can be reduced by elementary row operations as follows. 
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0 
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-1 
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1 
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0 
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-1 
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0 
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-1 
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0 1 
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-s 4 

0 

1 

-1 

-1 
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3 

9 
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0 

0 

1 

3 
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0 0 

5 

15 
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0 

0 

1 

3 
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’ 1 
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0 

-1 

0" 

0 

1 

0 

2 

0 

0 

0 

1 

3 

0 

1- 
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0 

0 

0 

0 _ 


Thus, the general solution is given by 


Xi = X4 
X 2 = —2x4 
X 3 = - 3 x 4 , 


where x 4 is a free variable. 


IfweletX 4 = 1, then we have a nonzero solutionx 1 = 1 ,X 2 = — 2 ,X 3 = — 3 ,X 4 = 1. 
Thus the set is linearly dependent. 

Substituting these values into (*), we have 
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1 


-1 


-2 

0 

-1 

- 2 

2 

3 

- 3 

-2 

0 

+ 

-2 

7 

. 0. 


.4. 


. 1. 


. 11. 


Solving this for the last vector we obtain the linear combination 


































■- 2 ' 


‘ 1 ' 


V 


‘-I' 

-2 

= _ 

0 

+ 2 

2 

+ 3 

-2 

7 


-1 

3 


0 

. 11. 


. 0 . 


.4. 


. 1 . 


Find the value(s) of h for which the following set of vectors 


/ 

< Vi = 

" 1 " 

0 

,v 2 = 

In 

1 

,v 3 = 

1 

2 h 

\ 

> 

c 

.0. 


_-h_ 


.3/1+1. 

> 


is linearly independent. 
(Boston College) 


Solution 1. 

Let us consider the linear combination 

JC1V1 +x 2 v 2 +X3V3 = 0 . (*) 

If this homogeneous system has only zero solution Xi=X2=X3=0, then the vectors v 1 , v 2 , v 3 are 
linearly independent. 

We reduce the augmented matrix for the system as follows. 


" 1 

h 

1 

0 " 

R 3 -\-hR 2 
-» 

" 1 

h 

1 

0 " 

0 

1 

2 h 

0 

0 

1 

2 h 

0 

0 

—h 

3/i + l 

0 


0 

0 

2 h 2 + 3/i + l 

0 


From this, we see that the homogeneous system (*) has only the zero solution if and only if 

2h 2 + 3h + 1 y 0. 


Since we have 


2h 2 + 3h + 1 = (2 h + 1 )(h + 1), 

if h ^ — 1, then 2h 2 + 3h + 1 ^ 0 . 

In summary, the vectors v 1 , v 2 , v 3 are linearly independent for any h except — , — 1. 

Solution 2. 

Note that the vectors v 1 , v 2 , v 3 are linearly independent if and only the matrix 


A 


[V1V2V3] 


I 

0 

0 


h 1 

1 2 h 

-h 3h + 1 


is nonsingular. 


Also, the matrix A is nonsingular if and only if the determinant det (A) is nonzero. 

So we compute the determinant of the matrix A by the first column cofactor expansion and obtain 




























det(^4) = 


1 h 1 
0 1 2 h 

0 -h 36 +1 

1 2 h 

-h 36 + 1 


= 26 2 + 36 + 1 
= (26 + l)(6 + l). 


Hence det(^4) 0 if and only if 6 j, — 1 . 

Therefore, the vectors v i , v 2 , v 3 are linearly independent for any values of h except — j , — 1. 


Q Let 


" 1 " 


" 1 " 


"O' 

2 

to 

II 

a 

II 

CO 

> 

4 

_ 0 „ 


_5_ 


_b_ 


be vectors in M 3 . Determine a condition on the scalars a, b so that the set of vectors {vi, V2, V3 } is linearly 
dependent. 


Solution. 

Consider the equation 

XjVj +x 2 v 2 +x 3 v 3 = 0, 
where 0 is the three-dimensional zero vector. 

Our goal is to find a condition on a, b so that the above equation has a nontrivial solution x 1? x 2 , x 3 . 
This equation is equivalent to the 3 x 3 homogeneous system of linear equations 


■1 1 0" 


■*i' 

2 a 4 


*2 

0 5 6 


*3 


We solve the system by Gauss-Jordan elimination. 
The augmented matrix of the system is 


"110 0 " 
2 a 4 0 

0 5 6 0 

Applying elementary row operations, we have 


















-1 

1 

0 

0" 

R 2~ 2R 1 

"1 

1 

0 

0" 

2 

a 

4 

0 


0 

a ~ 2 

4 

0 

0 

5 

b 

0 


0 

5 

b 

0 


■ 1 

1 

0 

O ' 

r 2 <^>r 3 

■1 

1 

0 

0 - 

0 

a — 2 

4 

0 

-► 

0 

1 

b/5 

0 

0 

1 

b/5 

0 


0 

a - 2 

4 

0 



'I 

1 

0 

O' 

R 3 -( a~2)R 2 

0 

1 

b/5 

0 




b(a-2) 



0 

0 

4 

0 


b(a-2) b(a-2) 

If the (3, 3) entry 4--— of the last matrix is 4--— is zero, then we obtain a matrix in 

echelon form 

"11 oo- 

0 1 Z?/5 0 

0 0 0 0 

This implies that x 3 is a free variable, hence the homogeneous system has a nonzero solution x 1? x 2 , *3 
. Hence in this case the set {v 1? v 2 , v 3 } is linearly dependent. 


b(a- 2 ) 

On the other hand, if 4--— ^ 0, we divide the third row by this number and obtain 


11 0 01 
0 1 b/5 0 

0 0 10 


and from this we see that the solution is x l = x 2 = x 3 = 0. 
Thus in this case the set {v 1? v 2 , v 3 } is linearly independent. 


b(a-2) 

In conclusion, the set {v 1? v 2 , v 3 } is linearly dependent if and only if 4--— = 0. 

Thus, the condition on a , b is 


b(a - 2) = 20. 


Related Question. 





















Problem. 


Determine conditions on the scalars a, b so that the following set S of vectors is linearly 
dependent. 

S= {Vi,v 2 ,v 3 }, 


where 



r 


'I ' 


-o- 

V 1 = 

3 

1 

> v 2 = 

a 

4 

> v 3 = 

2 

b 


Determine conditions on the scalars a, b so that the following set S of vectors is linearly dependent. 

S = {vi,v 2 ,v 3 }, 


where 



"1" 


"1" 


"o' 

Vl 

3 

,v 2 = 

a 

,v 3 = 

2 


_ 1 _ 


_ 4 _ 


6 


Solution. 


Let us consider the linear combination 


X 1 V 1 + x 2 v 2 +X 3 V 3 = 0. 

We want to find conditions on a, b so that there is (x 1 , x 2 , x 3 ) 7 ^ (0,0,0) satisfying this linear 
combination. 

Since the linear combination can be written as 


where 


Ax = 0, 


A — [v 1 , v 2 j v 3 ] 


"1 

1 

O' 


Xi 

3 

a 

2 

, and x = 

^2 

.1 

4 

b _ 


_^3_ 


our goal is the same as finding conditions on a, b so that A is a singular matrix. 


We apply elementary row operations to the augmented matrix \A | 0] . 



















[A\0] = 


1 1 0 

3 a 2 
1 4 b 


0 

0 

0 



’ 1 10 

0 " 


’ 1 10 

0 " 

R3-R1 



R2 ^Rs 



-» 

0 a- 3 2 

0 

-» 

0 3 b 

0 


0 3 b 

0 


0 a- 3 2 

0 


" 1 10 

0 " 

Rs — (a—3)R2 
-> 

"11 0 

0 " 

0 1 bf 3 

0 

0 1 b/S 

0 

0 a — 3 2 

0 


0 0 2 - (a - 3)b/3 

0 


The last matrix is in echelon form. 


Let c = 2 — (a — 3)6/3 be the number in (3, 3) -entry. 

If c 0, then it is easy to see that we can further reduce the matrix to 


1 

0 

0 

0 " 

0 

1 

0 

0 

O 

_1 

0 

1 

0 


and the only solution is x i = x 2 = x 3 = 0 . 


On the other hand, if c — 0, then the last row becomes a zero row and thus we see that x 3 is a free 
variable. Hence the system has a nonzero solution. 

Therefore, the vectors v 1 , V 2 , V 3 are linearly dependent if and only if c — 0. 

Hence the condition we are looking for is 

c = 2 - (a - 3)6/3 = 0 , 


or equivalently, 


(a — 3 )b — 6 . 


Related Question. 


Problem. 

Let 

Vi 

be vectors in R 3 . 


'1 


V 


"O' 

2 

<1 

to 

II 

a 

II 

CO 

> 

4 

. 0 . 


.5. 


_b_ 






























Determine a condition on the scalars a, b so that the set of vectors {v i, V 2 , V 3 } is linearly 
dependent. 


The solution is given in the post 1 

Determine a Condition on a, b so that Vectors are Linearly Dependent 


(a) Prove that the column vectors of every 3 x 5 matrix A are linearly dependent. 

(b) Prove that the row vectors of every 5 x 3 matrix B are linearly dependent. 


Proof. 

(a) Prove that the column vectors of every 3x5 matrix^ are linearly dependent. 
Note that the column vectors of the matrix A are linearly dependent if the matrix equation 


Ax — 0 


has a nonzero solution x E M 5 . 

The equation is equivalent to a 3 X 5 homogeneous system. 

As there are more variables than equations, the homogeneous system has infinitely many solutions. 

In particular, the equation has a nonzero solution x. 

Hence the column vectors are linearly dependent. 

(b) Prove that the row vectors of every 5x3 matrix B are linearly dependent. 

Observe that the row vectors of the matrix B are the column vectors of the transpose B T . Note that the 
size of B T is 3 X 5 . 

In part (a), we showed that the column vectors of any 3x5 matrix are linearly dependent. 

It follows that the column vectors of B T are linearly dependent. 

Hence the row vectors of B are linearly dependent. 

Suppose M is an n X n upper-triangular matrix. If the diagonal entries of M are all non-zero, then prove that the 
column vectors are linearly independent. Does the conclusion hold if we do not assume that M has non-zero 
diagonal entries? 


Proof. 

If the diagonal entries of M are all non-zero, then prove that the column vectors are linearly 
independent. 

Let x denote an arbitrary column vector of length n , and let 0 denote the zero vector of the same size. 

The columns of M are linearly independent if and only if the only solution to the equation Mx — 0 is 
the vector x = 0 . 





Consider the upper-triangular matrix 



m 1,1 

m 1,2 

m i ?3 • 

•• m^n 


0 

m 2,2 

m 2,3 • 

•• m 2 ,n 

M = 

0 

0 

m 3,3 • 

•• m^n 


0 

0 

0 

M n,n 


The equation Mx = 0 then yields a system of linear equations with n equations and n variables. 

To find a solution, consider the augmented matrix [ [c|c]M 0 ] • 

Because M is upper-triangular, we can use back-substitution to solve. The bottom row of the augmented 
matrix gives the equation m n , n x n = 0. 

By assumption, m y^ 0 because it is a diagonal entry. Thus we must have that x n — 0. 


Next, the second-to-last row in the augmented matrix gives the equation 

m n _ \ n _i x n i +m w _i jW x w = 0. Because x n — 0 andm w _i jW _i y^ 0 , we must have that 

X „-1 = 0 . 

We continue working backward in this way to see that x / = 0 for all 1 < i < n . Thus x = 0 , and so 
the columns of M must be linearly independent. 

Does the conclusion hold if we do not assume that M has non-zero diagonal entries? 

If the diagonal entries of M could be non-zero, then the columns might be linearly dependent. Consider 
the simple example 


0 Suppose that an n X m matrix M is composed of the column vectors bi, • • •, b m . Prove that a vector v E R n 
can be written as a linear combination of the column vectors if and only if there is a vector x which solves the 
equation Afx = v. 


Proof. 

Suppose that v is a linear combination of the vectors bi,..., v m . That is, suppose there are coefficients 
suc h^at 


Xibi + X2^>2 + • • * + x m h m — V. 


Then define the column vector 

"xi 
X 2 

X = 

m 









Then x satisfies the equation Mx = v . 


On the other hand, suppose that there is a vector x which satisfies the desired equation with components 
^ 1 5 2 5 * * * ) ^ m • 


Then these components can be used to find the linear combination 

xibi + x 2 b 2 + • • • +x m b m —Mx — v. 

Hence, the vector v is a linear combination of column vectors bi,..., b m . 


Nonsingular Matrices 


Definition 


Let A be an n X n matrix. 

A is nonsingular if the only solution to Ax = 0 is the zero solution x = 0. 


Summary 


Let A be an n X n matrix. 

If A is nonsingular, then A T is nonsingular. 

A is nonsingular if and only if the column vectors of A are linearly independent. 

Ax = b has a unique solution for every n X 1 column vector b if and only if A is nonsingular. 


Problems 


O 


Determine whether the following matrices are nonsingular or not. 


(a) A = 


(b )B = 


'10 1 " 
2 12 

1 0 -1 

"2 1 2 ] 

10 1 . 
4 14 


Solution. 

Recall that an n X n matrix is nonsingular if Ax — 0 has only the zero solution. This is equivalent to the 
condition that the rank of A is n. 


(a) Is A = 

'10 1 " 

2 12 

nonsingular? 


1 

o 



We find the rank of the matrix A as follows. 



'1 

0 

1 " 


'1 

0 

1 ' 

1 D 

'1 

0 

1 " 


'1 

0 

o' 

A = 

2 

1 

2 

R 2 -2Rl 
-» 

0 

1 

0 

~2 R * 
-» 

0 

1 

0 

R 1 R 3 
-» 

0 

1 

0 


1 

0 

-1 

^ 3—^1 

0 

0 

-2 


0 

0 

1 


0 

0 

1 


































The last matrix is in reduced row echelon form and has three nonzero rows. 
Hence, the rank of A is 3. 

It follows that the matrix A is nonsingular. 



'2 1 2" 


(b) Is B = 

1 0 1 

.4 1 4. 

nonsingular? 


Next, we compute the rank of B as follows. 


"2 

1 

2' 

Rl^R 2 

"1 

0 

r 

R 2 ~2Ri 

'i 

0 

i 

R 3 R 2 

'1 

0 

r 

1 

0 

1 

-» 

2 

1 

2 

- > 

0 

l 

0 

-» 

0 

1 

0 

.4 

1 

4. 


.4 

1 

4. 

Rs—4:Ri 

.0 

i 

0 . 


.0 

0 

0 . 


This computation shows that the rank of B is 2. 
Hence, the matrix B is singular. 


A 

Consider the matrix M 


1 4 
3 12 


(a) Show that M is singular. 

(b) Find a non-zero vector v such that Mv 


0 , where 0 is the 2-dimensional zero vector. 


Solution. 

(a) Show that M is singular. 


To see that M is singular, we will row-reduce the augmented matrix: 


1 

4 

0 

R 2 -3Ri 
-» 

l 

4 

0 

3 

12 

0 


0 

0 

0 


The row-reduced matrix has a row of all 0’s, and hence is singular. 


(b) Find a non-zero vector v such that Mv = 0. 


We see that the vector v = 


Vl 

lv 2 ] 


must satisfy the equation tq + 4^2 = 0. 

There are many solutions. One solution, for example, is found by setting V\ — 1 and solve v 2 =- 


^ Let A be the following 3 x 3 matrix. 


A = 


i i -r 
0 12 
1 1 a 


Determine the values of a so that the matrix A is nonsingular. 


Solution. 


We use the fact that a matrix is nonsingular if and only if it is row equivalent to the identity matrix. 





























We apply the elementary row operations as follows. 

R 3 —R 1 

A -» 

If a + 1 = 0, then the last matrix is in reduced row echelon form. 
Thus A is not row equivalent to the identity matrix. 


1 

1 

-1 

Rl — i?2 
- > 

1 

0 

-3 

0 

1 

2 

0 

1 

2 

.0 

0 

a T- 1 _ 


.0 

0 

CL -hi 


On the other hand, if a + 1 7 ^ 0, then we can continue the reduction as follows. 


'i 

0 

-3 ' 


'1 

0 

-3" 

R\ +3i?3 

'1 

0 

o' 

0 

1 

2 

-* 

0 

1 

2 

-» 

0 

1 

0 

.0 

0 

a - hi 


.0 

0 

1 _ 

i?2 — 2i?3 

.0 

0 

1_ 


Therefore A is row equivalent to the identity matrix. 

We conclude that the matrix A is nonsingular for any values of a except for a = — 1. 

Comment. 

If you know how to compute the determinant of a 3 X 3 matrix, then you may also solve this using 
the fact that a matrix is nonsingular if and only if the determinant of it is nonzero. 


o 

Determine the values of a real number a such that the matrix A 


3 

2 

0 


0 

3 

18a 


a 


0 

a + 1 


is nonsingular. 


Solution. 

We apply elementary row operations and obtain: 

A = 


R 2 -2Ri 


'3 

0 

a 

R 1 —R 2 

'1 

-3 

a 

2 

3 

0 

- > 

2 

3 

0 

.0 

18a 

Cl -hi 


.0 

18a 

Cl -h 1 _ 


'1 

-3 

a 

Rs~(2a)R2 

'1 

-3 

a 

0 

9 

—2a 


0 

9 

—2a 

? 

.0 

18a 

a -h 1 _ 


.0 

0 

4a 2 "T a -T 1 


From this, we see that the matrix A is nonsingular if and only if the (3, 3) -entry 4a 2 + a + 1 is not 
zero. 

By the quadratic formula, we see that 


a — 


-1± y^l5 
8 


are solutions of4a 2 +a + l = 0. 

Note that these are not real numbers. Thus, for any real number a, we have 4 a 2 + a + 1 0. 
































Hence, we can divide the third row by this number, and eventually we can reduce it to the identity matrix. 
So the rank of A is 3, and A is nonsingular for any real number a. 


(a) Suppose that a 3 X 3 system of linear equations is inconsistent. Is the coefficient matrix of the system 
nonsingular? 

(b) Suppose that a 3 X 3 homogeneous system of linear equations has a solution x\ — 0, a ? 2 = — 3, £3 = 5. Is 
the coefficient matrix of the system nonsingular? 


(c) Let A be a 4 X 4 matrix and let v = 


matrix A nonsingular? 


" 1 " 


"4" 

2 

and w = 

3 

3 


2 

_4_ 


_ 1 _ 


. Suppose that we have Aw — Aw. Is the 



Recall the following fact. 
Let A be an n X n matrix. 


Then Ax. — b has a unique solution for every n X 1 column vector b if and only if A is 
nonsingular. 

Solution. 

(a) Suppose that a 3 X 3 system of linear equations is inconsistent. Is the coefficient matrix of 
the system nonsingular? 

Let Ax — b be the system, where A is the coefficient matrix and b is the constant term vector. 
Because this system is inconsistent, then A is singular. (If A would be nonsingular, then the system 
has a unique solution.) 

(b) Suppose that a 3 X 3 homogeneous system of linear equations has a solution 
X\ = 0, £2 = —3, £3 = 5. Is the coefficient matrix of the system nonsingular? 

Recall that a homogeneous system of linear equations has always the zero solution. As the system has 
another solution x\ — 0, X2 = — 3, X3 = 5, the system Ax — 0 must have infinitely many 
solutions. Here A is the coefficient matrix. 

Thus, the coefficient matrix A is singular. (If A would be nonsingular, the system has only one 
solution, which must be the zero solution.) 

(c) Suppose that we have Aw = Aw. Is the matrix A nonsingular? 

Because Av — Aw, we have 



Note that 


A(w - w) = Aw - Aw = 0. 














is a nonzero vector. 

This implies that the homogeneous system Ax — 0 has infinitely many solutions as we have found a 
nonzero solution. 

Hence, A is singular. 


Let A be a 3 X 3 singular matrix. Then show that there exists a nonzero 3x3 matrix B such that 

AB — O, 

where O is the 3x3 zero matrix. 


Proof. 

Since A is singular, the equation Ax — 0 has a nonzero solution. 
Let x i be a nonzero solution of Ax = 0 . 

We define the 3 X 3 matrix B to be 


B = [xi,0,0], 

that is, the first column of B is the vector x \ and the second and the third column vectors are 3- 
dimensional zero vectors. 

Then since x i 0, the matrix B is not the zero matrix. 

We have 

AB =4xi, 0,0] 

= [Ax u A0,A0\ 

= [ 0 , 0 , 0 ] = 0 , 


since Ax\ = 0 . 

Thus we have found the nonzero matrix B such that the product AB — O . 

Comment. 

This is one of the midterm exam 1 problems of linear algebra (Math 2568) at the Ohio State University. 
In the exam the following hint was given: 

Hint: Let B — [xi, X 2 , X 3 ] , where X/ is the z-th column vector of B for i — 1,2,3. Then 
AB = [^xi,v4x 2 ,v4x 3 ] . 

Common Mistake 

Several students wrote “Since A is singular, A has a solution”. 

This does not make sense. A solution of what? You need to remember the definition correctly. 

A matrix A is singular if the equation Ax — 0 has a nonzero solution x. 



Also, note that x is a vector, not a number. 


O 


Let A be an n X n singular matrix. Then prove that there exists a nonzero n X n matrix B such that AB — O, 
where O is the n X n zero matrix. 


Definition. 

Recall that an n X n matrix A is called singular if the equation 


Ax = 0 


has a nonzero solution x £ R n . 

Proof. 

Since A is singular, there exists a nonzero vector b £ R n such that 


Ah = 0 . 


Then we define the n X n matrix B whose first column is the vector b and the other entries are zero. 
That is 


B=[ b 0 ••• 0]. 

Since the vector b is nonzero, the matrix B is nonzero. 

With this choice of B , we have 


AB = A[ b 0 ••• 0] 

= [Ab AO ••• AO] 

— [o 0 ... o]=o. 

Hence we have proved that AB — O with the nonzero matrix B. 


^ Let Vi and V2 be 2 -dimensional vectors and let A be a 2 X 2 matrix. 

(a) Show that if Vi, V2 are linearly dependent vectors, then the vectors Av \, A.V2 are also linearly dependent. 

(b) If Vi, V2 are linearly independent vectors, can we conclude that the vectors Av \, AV2 are also linearly 
independent? 

(c) If Vi, V2 are linearly independent vectors and A is nonsingular, then show that the vectors Av\ , A.V2 are also 
linearly independent. 

Proof. 

(a) If Vi , V 2 are linearly dependent, then Av\, Aw 2 are linearly dependent. 

Since Vi, V 2 are linearly dependent vectors, there exists (ci ,02 ) A (0,0) such that 


C1V1 + c 2 v 2 = 0 . 


Using this equality, we obtain 


0 = AO 

= A(ci\i + c 2 v 2 ) 

= A{c ivi) + A(c 2 v 2 ) 

= ci ( Av \) + c 2 {Aw2 ) (since ci, c 2 are scalars). 

Hence we have 


ci(Avi) + c 2 (iv 2 ) = 0 

with (ci,c 2 ) 7^ (0,0). 

This implies that the vectors Avr, Av 2 are linearly dependent. 


(b) If Vi, v 2 are linearly independent, can we conclude that Av\ , Av 2 are linearly 
independent? 


The answer is no. In general, even though Vi, v 2 are linearly independent vectors, the vectors 
Av \, Av 2 might be linearly dependent. Let us give an example. 


Letvi = 




Then it is straightforward to see that these vectors are linearly independent. 
Let 


A = 


0 

0 


0 

0 


be the 2 X 2 zero matrix. 


Then we have 


Av\ = 


,Av 2 = 


and clearly Aw \, Aw 2 are linearly dependent vectors. 


0 

0 


(c) If Vi, v 2 are linearly independent and A is nonsingular, then Av\ , -Av 2 are linearly 
independent. 

Consider the linear combination 


ci(Avi) + c 2 (A\ 2 ) = 0 . 

To show that Aw i, j4v 2 are linearly independent, we show that ci — c 2 =0. 

The above equality can be written as 

A(c\V\ + c 2 v 2 ) = 0 . 

Since the matrix A is nonsingular, the homogeneous equation Ax. — 0 has only the zero solution. 
Therefore we have 


C1V1 + c 2 v 2 = 0 . 

(Another reasoning here is that since the matrix A is nonsingular, it is invertible, and thus we have the 
inverse matrix A ~ l . Multiplying by A~ l on the left, we obtain the same equation.) 














Now, since Vi, V 2 are linearly independent by assumption, it follows that C\ — C 2 = 0. 
Hence we conclude that the vectors Av \, Av 2 are linearly independent. 


^ Let A be an n X n matrix. Suppose that the sum of elements in each row of A is zero. Then prove that the matrix 
A is singular. 


Definition. 

Ann X n matrix A is said to be singular if there exists a nonzero vector v such that A v = 0 . 
Otherwise, we say that A is a nonsingular matrix. 


Proof. 


Let v = 


be the n -dimensional vector all of whose entries are 1. 


Then we compute Ay . 


Note that Ay is an n -dimensional vector, and the z-th entry of Ay is the sum of the z-th row of the matrix 
A for z = 1,... ,zz . 

It follows from the assumption that the sum of elements in each row of A is 0 that we have 


Av = 0. 


As v is a nonzero vector, this equality implies that A is a singular matrix. 


Example 


To illustrate the proof, let us consider a concrete example. 

"1-4 3 

Letv4 =25-7 

_3 0 -3. 

Then the sum of each row of A is zero. 


Now we compute 


'1-4 3 ' 


T" 


'1 + (-4) +3' 


"O' 

2 5-7 


1 

= 

2 + 5 + (-7) 

= 

0 

.3 0 -3. 


.1. 


_3 + 0 + (—3)_ 


.0. 


This shows that the matrix A is singular because we have Ay = 0 for a nonzero vector v. 


Prove that if n X n matrices A and B are nonsingular, then the product AB is also a nonsingular matrix. 
(The Ohio State University) 
















Definition (Nonsingular Matrix) 

An n X n matrix is called nonsingular if the only solution x £ J$L n of the equation Ax = x is x = 0 . 
Proof. 

We give two proofs. The first one uses a property of the determinants of matrices, and the second one uses 
the definition of nonsingular matrices. 

Proof 1. (Using Determinant) 

Recall that a matrix is nonsingular if and only if its determinant is not zero. 

Since A and B are nonsingular, we know that 

det(^4) 7 ^ 0 and det(5) ^ 0 . 

Then we have using the multiplicative property of the determinant 

det(v45) = det(v4) det(5) ^ 0. 

Since the determinant of the product AB is not zero, we conclude that AB is a nonsingular matrix. 

Proof 2. (Using Definition of Nonsingular Matrices) 

Suppose that A, B are nonsingular matrices. 

This means that if Ax = 0 for some the vector x £ J$L n , then we must have x = 0 . 

Same for B . 

Suppose that we have ( AB)x — 0 for some vector x £ WL n . 

Let v = Bx £ R” . Then we have 


Av = ABx = 0. 


So since A is a nonsingular matrix, we have v = 0 , namely, Bx = 0 . 

Since B is nonsingular, this further implies that x = 0 . 

In summary, whenever ( AB)x — 0 , we have x = 0 . 

Therefore, the matrix AB is nonsingular. 


(a) Let A — ( dij ) be an n X n matrix. Suppose that the entries of the matrix A satisfy the following relation. 

I > kill H-f kii-ll + kii+lH-1" kin I 

for all 1 < i < n. Show that the matrix A is nonsingular. 

(b) Let B — (bij) be an n X n matrix whose entries satisfy the relation 

|6ii| = 1 and \bij \ < ——- 

for all i and j with i ^ j. Prove that the matrix B is nonsingular. 

(c) Determine whether the following matrix is nonsingular or not. 



7T e e 2 /2-7r 2 

C = e 2 /2 -k 2 7 r e 

e e 2 /2-7r 2 7r 

where 7T = 3.14159 ..and e = 2.71828 ... is Euler’s number (or Napier’s constant). 



0 For (a), assume that the matrix A is nonsingular. 

Then the system of linear equation Ax — 0 has nonzero solution. 


0 For (b), apply (a). 


0 For(c), use part (a). Note e 2 /2zr 2 = 0.37433 ... . 


Proof. 

(a) Show that the matrix A is nonsingular. 

Suppose that the matrix A is nonsingular. Then the system of linear equations 

011*1 H-+ ai n x n = 0 

0/21 *1 + * • * + 0/272 * 72 0 

has nontrivial solution (x i,. . ., x n ) 7 ^ (0,. . ., 0) . 

Since not all of x / are zero, let x k be the nonzero number such that |x k | is the largest among |x / 
That is, 

\xk\ — max lxj I > 0. 

1 < 2<72 


Then from the k-th equation in the system, we have 

—dkkXk — &k\X 1 + * * * + dkk-lXk-l + Ukk+lXk+l + * * * + ^kn x n . 
Taking the absolute values of both sides and by the triangle inequality, we have 

|Ukk ||*fc| — |0H*1 + * * * + CLkk-lXk-1 + Mkk+lXk+1 + * * * + 0 kn x n \ 

< \dkl 1 1 * 1 1 + * * * + \cikk-l | \%k-l | + \ a kk+l 1|*fc+l | + ' * * + \dkn \ \ x n 


Since x k is nonzero, we divide by |x k | and we have 






| @kk | ^ |&kl 


Fi| 

1**1 


+ • • • + \akk-i 


\xk -1 

1**1 


+ \dkk+l | 


l**+i| 

I** I 


+ * * * + 



By the choice of , the absolute value of is maximal, hence |x z -1 / |x k | < 1 for all i. 
Therefore we have 


Wkk | < l^il | + *-* + Wii-l | + |^i i+l | + **- + Win I* 

However this contradicts the given condition. Therefore the matrix A is nonsingular. 

(b) Prove that the matrix B is nonsingular. 

Note that 


l^z'i ! + ••• + \ba-i | + \ba+i | + 

1 1 n - 1 

< -- + ••• + 


| h 


n — 1 


n — 1 


n — 


Y = 1 = 1 ^ 1 - 


Therefore the matrix 5 satisfy the condition of part (a). Therefore from the result of (a), the matrix B is 
nonsingular. 

(c) Determine whether the matrix is nonsingular or not. 

Let us apply part (a) to the matrix C . 

We have 

e 2 

— = 0.37433.... 

2tt 2 

Thus we calculate 

e 2 

e + —- = 3.09262 • - • < n = 3.14159... 

2 7T 2 

Thus the sum of the (absolute value of) off-diagonal entries are less than the diagonal entry. Hence by part 
(a), the matrix C is nonsingular. 

Comment. 

These are simple and interesting criteria for the non singularity of a matrix. 

You may easily cook up matrices satisfying (or not satisfying) the conditions. 

Why don’t you make one and give it to your friend for a math quiz? 


Let A be an n X n matrix. Suppose that y is a nonzero row vector such that y A = y. (Here a row vector means 
a 1 X n matrix.) Prove that there is a nonzero column vector x such that Ax. — x. (Here a column vector means 
an n X 1 matrix.) 


Proof 1 .(Without the theory of eigenvalues) 

Let / be the n X n identity matrix. Then we have 












Oixh = yA - y = y(A - I), 


where 0 \ xn is the row zero vector. 

Taking the transpose, we have 

Onxi = 0j- x „ = (y(A —/)) T 

= (^-hV. 

T 1 

Since the vector y is nonzero, the transpose y is a nonzero column vector. 
Thus, the above equality yields that the matrix (A — /) T is singular. 

It follows that the matrix A — I is singular as well. 

Hence there exists a nonzero column vector x such that 

(A -I)x = 0 „xi, 


and consequently we have 


Ax = x 


for a nonzero column vector x. 


Proof 2 . (Using the theory of eigenvalues) 

Taking the conjugate of the both sides of the identity yA — y , we obtain 

A T y T =y T . 

T 1 

Since y is a nonzero row vector, y is a nonzero column vector. 

It follows that 1 is an eigenvalue of the matrix A T and y T is a corresponding eigenvector. 

rr\ 

Since the matrices A and A has the same eigenvalues , we deduce that the matrix A has an eigenvalue 1. 
(See part (b) of the post “Transpose of a matrix and eigenvalues and related questions/ 4 .) 

Let x be an eigenvector corresponding to the eigenvalue 1 (by definition x is nonzero). Then we have 


Ax = x 


as required. 


^ Without using properties of determinants 

Determine whether each of the following statement is True or False. 

(a) Suppose that A and B are nonsingular n X n matrices. Then A + B is nonsingular. 

(b) If a square matrix has no zero rows or columns, then it has an inverse matrix. 

(c) Let A be an m X n matrix. 

If the equation ^4x = 0 has only the trivial solution x E M n , then the columns of A are linearly independent. 

(d) Let A be an m X n matrix. 

If the equation Ax — 0 has only the trivial solution x E M n , then the rows of A are linearly independent. 

(e) The row echelon form of an 3 X 3 matrix is invertible. 

(f) There is a non-zero nonsingular matrix A such that A 2 = O. 

(g) If A and B are invertible n X n matrices, then AB — BA. 

(h) If A and B are n X n nonsingular matrices such that A 2 — I and B 2 — /, then ( AB) -1 = BA. 

(i) If A is an m X n matrix such that Ax — 0 for every vector x in R n , then A is the m X n zero matrix. 


(j) Let A be a 2 X 2 nonsingular matrix and let Vi and V 2 be linearly independent vectors in R 2 . 
Then the vectors Avi , AV 2 are linearly independent vectors in R 2 . 


□ 


G Let A be an n X n nonsingular matrix. Prove that the transpose matrix A T is also nonsingular. 


Definition (Nonsingular Matrix). 

By definition, A T is a nonsingular matrix if the only solution to 

A T x — 0 


is the zero vector x = 0 in R” . 

Proof. 

Suppose that 


T v = 0 


for an n -dimensional vector v. 
We prove that v = 0 . 


Since A is nonsingular, there exists a vector uGf such that 

Am — v. 

(For those who know the inverse matrix, the vector u is given by u = A _1 v . The fact can be proved 
without using the inverse matrix, though.) 

Hence we obtain from (*) 

A t Au = 0. 


It follows that 


u t A t Au = u T 0 = 0. 


By the property of the transpose, we have 

u T A T = (^4u) T . 


Thus we have 

0 = (AufAu = pu|| 2 

This yields the length pu|| = 0 , and hence A u = 0 . 

Since v = Am , we conclude that v = 0 . 





Therefore if A T v = 0 , then v = 0 . 

This shows that the transpose A T is nonsingular. 


Let A be an n X [n — 1) matrix and let b be an (n — 1) -dimensional vector. Then the product Ah is an n- 
dimensional vector. Set the n X n matrix B — [Ai, A 2 , • • •, A n -i,Ah], where A{ is the l -th column vector of 
A. Prove that B is a singular matrix for any choice of b. 


Definition/Hint. 

An ft X ft matrix B is nonsingular ifv4x = 0 ,xGK” implies that x = 0 . 
Otherwise, the matrix B is called singular. 

Namely, the matrix B is singular if there exists a nonzero vector v such that By — 0 . 


You may use the fact that a matrix is nonsingular if and 

We give two proofs. The first one uses the definition of 
mentioned above. 

Proof (Using Defition). 

Let 


only if its column vectors are lienarly independent, 
a singular matrix. The second one uses the fact 


bi 

k> 

_ b n i _ 


Then we have 


Ah — b\A i + 2 + * * * + b n -\A n -\ 

using the column vectors A / of A . 


(*) 


Let us define the ft -dimensional vector v to be 

bi 

b 2 

v = • 

bn —1 

-1 


Since the last entry of v is —1, the vector v is nonzero. 
We calculate 







h 

bz 


By = [A 1 ,A 2 ,...,A n _ 1 ,Ab\ 


bn —1 

-1 


— b\A i + bzA 2 + • • • + b n -\A n - 1 + (— l)/4b 


(*) 

= b\A i + 2 + • • * + b n -\A n -\ — ( b\A i + Z?2v4 2 + * * * + b n -\A n -\) 

= 0 , 


where 0 is the n -dimensional zero vector. 


Since we have Bv = 0 for a nonzero vector v, we conclude that the matrix B is singular. 

Proof (Using the Fact). 

Note that a square matrix is singular if and only if its columns vectors are linearly dependent. 

We prove that the column vectors A \ , A 2 , • • •, A n -\ , Ah of the matrix B are linearly dependent. 


Let 


Then we have 


b\ 

b2 

bn —1 


Ah — b\A 1 + Z?2v4 2 + * * • + b n -\A n -\ . 

Thus, we have the linear combination of column vectors of B 

b\A 1 + Z? 2^4 2 + * * * + b n -\A n -\ — Ah — 0 . 

Since the coefficient in front of the vector Ah is —1, the left hand side is a nontrivial linear combination 
of column vectors of B . 

This implies that the column vectors of B are linearly dependent, hence the matrix B is singular. 


€> Ann X n matrix A is called nonsingular if the only vector x E M n satisfying the equation Ax = 0 is x = 0. 
Using the definition of a nonsingular matrix, prove the following statements. 

(a) If A and B are n X n nonsingular matrix, then the product AB is also nonsingular. 

(b) Let A and B be n X n matrices and suppose that the product AB is nonsingular. Then: 

1. The matrix B is nonsingular. 

2. The matrix A is nonsingular. (You may use the fact that a nonsingular matrix is invertible.) 


Proof. 

(a) If A and B are ft X n nonsingular matrix, then the product AB is also nonsingular. 






Let vGf and suppose that ( AB)x — 0 . 
Our goal is to prove that x = 0 . 

Let y := Bx G R” . Then we have 


Ay = A(Bx ) = (AB)x = 0. 

Since A is nonsingular, this implies that the vector y = 0 . 

Hence we have y = Bx = 0 . 

Since B is nonsingular, this further implies that x = 0 . 

It follows that if (AB)x — 0 , then we must have x = 0 . 

By definition, this means that the matrix AB is nonsingular. 

(b)-1. If AB is nonsingular, then B is nonsingular. 

Suppose that Bx = 0 . We prove that x = 0 . 

Since Bx = 0 , it yields that 

(AB)x = A(Bx) =,40 = 0. 


As the matrix AB is nonsingular, it follows from ( AB)x — 0 that x = 0 . 

This proves that the matrix B is nonsingular. 

(b)-2. If AB is nonsingular, then A is nonsingular. 

By part (1), we know that B is nonsingular, hence it is invertible. 

The inverse matrix B -1 and the matrix AB are both nonsingular. 

Hence it follows from part (a) that the product of AB and B -1 is also nonsingular. 
Thus, 

A = [AB)B~ 1 


is a nonsingular matrix. 

Nonsingular if and only if Invertible 

For the proof of the fact we used in the proof of (b)-2 that a matrix is nonsingular if and only if it 
invertible, see the posLr 

A Matrix is Invertible If and Only If It is Nonsingular 


^ Let A be a singular n X n matrix. 
Let 


T—1 


"0" 


"0" 

0 

5 e 2 = 

1 

,..., e n — 

0 

1 

• O 
_l 


1 

o • 


_1_ 


be unit vectors in R n . 









Prove that at least one of the following matrix equations 

Ax — Gi 

for i — 1, 2,..., n, must have no solution x E R n . 

Proof. 

Assume on the contrary that each matrix equation Ax — e / has a solution. 

Let bf E MA be a solution of Ax — e z for each i — 1 ,..., n . 

That is, we have 

AId i — 6 i. 

Let B = [bi,b 2 ,...,b w ] be the n X n matrix whose z-th column vector is b /. 


Then we have 

AB = ^[bi,b 2 ,...,b„] 

= [Abi,Ab2, • • • ,Ab„] 
= [ei,e 2 , • • • ,e B ] = I, 


where / is the n X n identity matrix. 


Since / is the nonsingular matrix, the matrix A must also be nonsingular. 
However this contradicts the assumption that A is singular. 

It follows that at least one of the matrix equations Ax — e z - has no solution. 


0 Using the numbers appearing in 

7r = 3.1415926535897932384626433832795028841971693993751058209749... 

we construct the matrix 


3 

14 

1592 

65358 

97932 

38462643 

38 

32 

7950 

2 

8841 

9716 

939937510 

5820 

974 

9 


Prove that the matrix A is nonsingular. 


Proof. 

To show that the matrix A is nonsingular, it suffices to prove that det(^4) 0 . 

One way is to compute the determinant of A directly. 

However, as the numbers inv4 are quite large for hand computation, the direct calculation must be tedious. 





So we consider an alternative method. 

Note that we do not have to find the exact value of det (A) , but we just need to know det (A) ^ 0 . 

Thus, it suffices to show that det (A) is odd. (0 is an even number.) 

This suggests that considering the matrix modulo 2 is helpful. 


Letv4 be the matrix whose (ij) -entry is the (ij) -entry ofv4 modulo 2. 
That is, 


A 


10 0 0 
0 10 0 
0 0 10 
0 0 0 1 


(Remark that the diagonal entries of A are odd, and off-diagonal entries are even.) 

Since det(yf) is a polynomial of entries ofv4 , we have 

det (A) = det (A) (mod 2) 

= 1 . 

It follows that det (A) is odd, and in particular det (A) ^ 0 . 

Thus the matrix A is nonsingular. 

What’s det (A) anyway? 

Just for the record, the determinant of A is 

det(v4) = -20330769121541702776233175. 

Beautiful Formulas for n 

This problem was nothing to do with the number n (except we used the digits of n) and the matrix is far 
from beautiful. 

(Although the method we used is beautiful.) 

Check out the post 

Beautiful formulas for pi=3.14... 

for beautiful formulas containing n like 


n — 


1 + 


2 + 


2 + 


2 + 


2 + --. 










Inverse Matrices 


Definition 


Q Ann X n matrix A is said to be invertible if there exists an n X n matrix B such that 

AB = BA = I. Such a matrix B is unique and called the inverse matrix of A , denoted by A~ l 


Summary 


Let A, B be n X n matrices. 

A is invertible if and only if rref ([^4 \ I n ]) — [I n \ A f ] for some n X n matrix A f . In this case, 

A' = A- 1 . 

Q A is invertible if and only if A is nonsingular. 

If A, B are invertible, then (AB ) -1 = B~ l A~ l 
a b 


A 2 X 2 matrix A = 


c d 


If A is invertible, then the inverse matrix is given by A = 


is invertible if and only if the determinant det(^ 4 ) = ad — be ^ 0. 

d -b 
—c a 


l 


det(v4) 


If A is invertible, then A T is invertible and (A T ) 1 = (A 


Problems 

For each of the following 3x3 matrices A, determine whether A is invertible and find the inverse A ~ 1 if exists 
by computing the augmented matrix [A\I\ , where / is the 3 X 3 identity matrix. 

'1 3 -2" 

(a) A — 2 3 0 

0 1 -1 
"1 0 2 
(b )A= -1-3 2. 

3 6-2 


Elementary Row Operations and Inverse Matrices 

Recall the following procedure of testing the invertibility of A as well as finding the inverse matrix if 
exists. 

If the augmented matrix [A |/] is transformed into a matrix of the form [I\B] , then the matrix A is 
invertible and the inverse matrix A -1 is given by B . 

If the reduced row echelon form matrix for [A |/] is not of the form [I\B] , then the matrix A is not 
invertible. 


























Solution. 


(a) A = 


13-2 
2 3 0 

0 1-1 


We apply the elementary row operations as follows. 
We have 



' 1 

3 

-2 

1 

0 

0 " 



" 1 

3 

-2 


1 

0 

0 " 









*2' 

-2/21 









'M = 

2 

3 

0 

0 

1 

0 


- » 

0 

-3 

4 

— 

2 

1 

0 



0 

1 

-1 

0 

0 

1 



0 

1 

-1 

0 

0 

1 



" 1 


3 - 

2 

1 

0 

0 " 

Ri - SR 2 

" 1 

0 

1 


1 

0 

-3 " 

R 2 AA-R 3 








R 3 +3 R 2 








- > 

0 


1 - 

1 

0 

0 

1 


- » 

0 

1 - 

-1 


0 

0 

1 


0 


-3 

4 

-2 

1 

0 



0 

0 

1 


-2 

1 

3 


R 1 -R 3 

^ 2+^3 


1 0 0 
0 1 0 
0 0 1 


3 -1 -6 

-2 14 

-2 13 


The left 3x3 part of the last matrix is the identity matrix. 

This implies that A is invertible and the inverse matrix is given by the right 3x3 matrix. 
Hence 


A~ x = 


3 

-2 

-2 


-1 -6 

1 4 

1 3 



" 1 0 2 

(b )A = 

-1 -3 2 

.3 6-2. 


Now we consider the matrix A = 


1 

-1 

3 


0 2 
-3 2 

6 -2 


Applying elementary row operations, we obtain 


m 


1 

0 

2 

1 

0 

0 " 

R 2 ~hR 1 
R 3 SR 1 

" 1 

0 

2 

1 

0 

0 " 

-1 

-3 

2 

0 

1 

0 

- » 

0 

-3 

4 

1 

1 

0 

3 

6 

-2 

0 

0 

1 


0 

6 

-8 

-3 

0 

1 


R 3 - 2 Ri 

" 1 

0 2 

1 0 0" 


10 2 

1 0 0" 

- > 

0 

0 

-3 4 

0 0 

1 10 

-5 -2 1 

-> 

0 1 -4/3 

0 0 0 

-1/3 -1/3 0 

-5 -2 1 


The last matrix is in reduced row echelon form but the left 3x3 part is not the identity matrix /. 
It follows that the matrix A is not invertible. 


Related Question. 













































Problem. 

Find the inverse matrix of 


A = 


1 1 
0 0 
1 0 


2 

1 

1 


if it exists. If you think there is no inverse matrix of A , then give a reason. 


Let A be the matrix 

' 1-1 0 " 

0 1 - 1 . 

.0 0 1 . 

Is the matrix A invertible? If not, then explain why it isn’t invertible. If so, then find the inverse. 
(The Ohio State University) 


Hint. 

Consider the augmented matrix [A\I\ , where / is the 3 X 3 identity matrix. 

Solution. 


Consider the augmented matrix [A |/] with the 3 X 3 identity matrix /. 
Reduce [A |7] using elementary row operations as follows. 


m 


" 1 

-l 

0 

1 

0 

0 " 

0 

l 

-1 

0 

1 

0 

O 

_i 

0 

1 

0 

0 

1 


R 2+^3 




R i -\-R 2 
-» 


1 

0 

0 

1 

0 


-1 0 

1 0 

0 1 

0 0 

1 0 


1 
0 

0 0 


0 0 
1 1 


0 0 1 


111 
Oil 
0 0 1 


This is in reduced row echelon form and the left 3x3 part is the identity matrix. Hence A is invertible 
and the inverse matrix is 


A~ l = 


111 
0 11 
0 0 1 


Find the inverse matrix of 


A = 


1 1 
0 0 
1 0 


2 

1 

1 





















if it exists. If you think there is no inverse matrix of A, then give a reason. 
(The Ohio State University) 


Solution. 

To check whether the matrix A has the inverse matrix and to find the inverse matrix if exist at once, we 
consider the augmented matrix [A \ I] , where / is the 3 X 3 identity matrix. 

We apply the elementary row operations as follows. 


1 

1 

2 

1 

0 

0 " 

i?3— 

" 1 

1 

2 

1 

0 

0 " 

0 

0 

1 

0 

1 

0 

- > 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 


0 

-1 

-1 

-1 

0 

1 


" 1 

1 

2 

1 

0 

0 " 

R 2 Oi? 3 

" 1 

1 

2 

1 

0 

0 " 

0 

0 

1 

0 

1 

0 

-> 

0 

1 

1 

1 

0 

-1 

0 

1 

1 

1 

0 

-1 


0 

0 

1 

0 

1 

0 


" 1 

0 

1 

0 

0 

1 " 

R i R 3 

R 2 R 3 

" 1 

0 

0 

0 

-1 

1 " 

0 

1 

1 

1 

0 

-1 

- > 

0 

1 

0 

1 

-1 

-1 

0 

0 

1 

0 

1 

0 


0 

0 

1 

0 

1 

0 


The left 3x3 matrix part became the identity matrix /, thus A is invertible (since it is row equivalent to / 
), and the inverse matrix A ~ 1 is given by the right 3x3 matrix. Thus we have 


A^ = 


0 -1 1 

1 -1 -1 

0 1 0 


Double check 

Once you obtained the inverse matrix A -1 , then you should check that whether A -1 A is the identity 
matrix. 

If not, it means that you did a computational mistake somewhere. 


O 


Let A be the following 3x3 upper triangular matrix. 


A = 


1 

0 

0 


x y 

1 z 5 

0 1 . 


where cc, 7 /, z are some real numbers. Determine whether the matrix A is invertible or not. If it is invertible, then 
find the inverse matrix A~ 1 . 


Solution. 

We form the augmented matrix 































[A \I\ = 


1 

X 

y 

1 

0 

0 " 

0 

1 

z 

0 

1 

0 

0 

0 

1 

0 

0 

1 


and apply elementary row operations as follows. 


1 

X 

7 

1 

0 

0 " 

R 1 -xR 2 

" 1 

0 

y — xz 

i 

—x 

0 " 

0 

1 

z 

0 

1 

0 

- > 

0 

1 

Z 

0 

1 

0 

0 

0 

1 

0 

0 

1 


0 

0 

1 

0 

0 

1 


Ri-(y-xz)R 3 

R 2 -ZR 3 

’ 1 

0 

0 

1 

—x 

1 

- > 

0 

1 

0 

0 

1 

—z 


0 

0 

1 

0 

0 

1 


We could reduce the matrix A into the identity matrix /. 

Thus, the matrix A is invertible and the right 3x3 matrix is the inverse matrix of A -1 
Hence, 


A - 1 


1 —x xz — y 

0 1 -z 
0 0 1 


o 


For which choice(s) of the constant k is the following matrix invertible? 


A = 


1 1 
1 2 
1 4 


1 

k 

k 2 


(Johns Hopkins University) 


Hint. 

An n X n matrix is invertible if and only if its rank is n . 

The rank of a matrix is the number of nonzero rows of a (reduced) row echelon form matrix that is row 
equivalent to the given matrix. 

Solution. 

We compute the rank of the matrix A . 

Applying elementary row operations, we obtain 


'll 1 " 

R 2 R 1 

'1 1 

1 

Rl—R2 

'1 0 2 — k ' 

12 k 

^3—^1 
- > 

0 1 

k — 1 

R3SR2 
- > 

0 1 k- 1 

.1 4 k 2 _ 


.0 3 k 2 - 1. 


.0 0 k 2 - 3 k+ 2 _ 


The last matrix is in row echelon form. 

Note that A is an invertible matrix if and only if its rank is 3 . 

Therefore the ( 3 , 3 ) -entry of the last matrix must be nonzero: k 2 — 3 k + 2 = [k — 1 ) [k — 2 ) 0 . 





























It follows that the matrix A is invertible for any k except k — 1,2. 

Suppose that M , P are two n X n non-singular matrix. Prove that there is a matrix N such that MN — P. 

Proof. 

As non-singularity and invertibility are equivalent, we know that M has the inverse matrix M -1 . 

Let us think backwards. Suppose that we have MN — P for some matrix N , which we want to find. 
Then multiply M ~ l on the left of the equation MN = P yields N = M ~ l P . 


This is the matrix we are looking for, as MN — M{M 1 P) — P . 

Let A be an n X n matrix satisfying A 2 + C\ A + CqI = O, where Co, C\ are scalars, / is the n X n identity 
matrix, and O is the n X n zero matrix. Prove that if Cq 7^ 0, then the matrix A is invertible (nonsingular). How 
about the converse? Namely, is it true that if Cq = 0, then the matrix A is not invertible? 


Proof. 

If Co 7^ 0 , then the matrix A is invertible 

Suppose first that Co 7 ^ 0. 

Then we have 

A 2 + civ4 = —col 

< v=7 > Ai^A "T Cr/) = —Co/ 


It is in the last step that we needed to assume Cq 7^ 0 . 


Thus, if we put 


then we have proved that 


B — (A + ci/), 

<?o 


AB = I. 

Similarly, one can check that BA = A . Hence B is the inverse matrix of A . 
Namely, 

A - 1 = —(A + al). 

CO 

This proves that when Cq 7^ 0 the matrix A is invertible. 




Is it true that if Co = 0 , then the matrix A is not invertible? 

Next, let us consider the case Co = 0. 

We claim that the matrix A can be invertible even Co = 0. 

For example, if A = / , then A satisfies 

A 2 -A 

(Thus, Ci = — 1 and Co = 0.) 

Since the identity matrix is invertible, the condition Co 
invertible. 

A square matrix A is called idempotent if A 2 — A. Show that a square invertible idempotent matrix is the 
identity matrix. 


= o . 


= 0 does not force the matrix A to be non- 


Proof. 

Let A be an n X n invertible idempotent matrix. 

Since A is invertible, the inverse matrix A ~ 1 of A exists and it satisfies A~ l A — I n , where I n is 
n X n identity matrix. 

Since A is idempotent, we have A 2 — A . 

Multiplying this equality by A ~ 1 from the left, we get A~ l A 2 — A ~ 1 A . Using the fact that 
A ~ 1 A — I n , we obtain^ = I n . 

The proof is completed. 

Related Question. 

Give it a try with the following problems about idempotent matrices. 


Problem. 

(a) Let u be a vector in M” with length 1 . 
Define the matrix P to be P — uu T . 

Prove that P is an idempotent matrix. 


(b) Suppose that u and v be unit vectors in R” such that u and v are orthogonal. 
Let Q = uu T + vv T . 

Prove that Q is an idempotent matrix. 


(c) Prove that each nonzero vector of the form au. + bv for some a,b E R is an eigenvector 
corresponding to the eigenvalue 1 for the matrix Q in part (b). 



o 

Find the inverse matrix of A 


reason. 


1 

1 

2 


0 1 " 
0 0 
1 1 


if it exists. If you think there is no inverse matrix of A, then give a 


(a) Find the inverse matrix if exists. 

We consider the augmented matrix [A \ I] , where / is the 3 X 3 identity matrix, and apply elementary 
row operations as follows. We have 


" 1 

0 

1 

1 

0 

1 

o 

R2—R1 

R3-2R1 

" 1 

0 

1 

1 

0 

0" 

1 

0 

0 

0 

1 

0 

\ 

0 

0 

-1 

-1 

1 

0 


2 

1 

1 

0 

0 

i 


0 

1 

-1 

-2 

0 

1 


1 

0 

1 

1 

0 

0" 

CO 

1 

' 1 

0 

1 

1 

0 

0 " 

0 

1 

-1 

-2 

0 

1 

- > 

0 

1 

-1 

-2 

0 

1 

0 

_1 

0 

-1 

-1 

1 

0 


0 

0 

1 

1 

-1 

0 


' 1 

0 

0 

0 

1 

0 ' 

0 

1 

0 

-1 

-1 

1 

0 

0 

1 

1 

-1 

0 


Therefore, we could reduce the matrix A into the identity matrix (A is row equivalent to I) and thus A is 
invertible, and the inverse matrix is given by the right half of the last matrix. 

Hence the inverse matrix is 


A- 1 


0 10 
-1 -1 1 
1-10 


Find a nonsingular 2x2 matrix^ such that A 3 = A ~B — 3.1 2 . 

Suppose that A is a nonsingular matrix such that A 3 = A 2 B — 3A 2 . 

Since A is nonsingular, it is invertible and thus the inverse matrix A -1 exists. 


Then we have 

A = A ~ 2 A 3 
= A~ 2 (A 2 B — 3A 2 ) 

= A ~ 2 A 2 B - 3A ~ 2 A 2 
= IB -31 
= B-31 
1 1 ' 

.2 3 

To prove that this matrix is nonsingular we calculate the determinant of A . 

(Recall that if the determinant of a matrix is nonzero, then it is invertible, hence nonsingular.) 
The determinant of 2 X 2 matrix A is 




























det(^) = l- 3- l- 2 = 1^0. 


Since the determinant of A is nonzero, we conclude that A is nonsingular. 
Thus, the nonsingular matrix A satisfying A 3 — A 2 B — 3A 2 must be 


A = 


1 1 
2 3 


0 


4 1 
2 6 


. Verify that the matrix 


Find a nonsingular 2x2 matrix A such that A 3 — A 1 B — 3A~ , where B — 

A you obtained is actually a nonsingular matrix. 

(a) Find the inverse matrix if exists. 

We consider the augmented matrix [A \ I], where I is the 3 X 3 identity matrix, and apply 
elementary row operations as follows. We have 


" 1 

0 

1 

1 

0 

0 " 

1 _ 

1 

0 

0 

0 

1 

0 

-* 

2 

1 

1 

0 

0 

1 



10 1 
0 0-1 
0 1-1 


10 0 
-110 
-2 0 1 


i?2 


" 1 

0 

1 

1 

0 

0 " 

CO 

i 

" i 

0 

1 

1 

0 

0 " 

0 

l 

-1 

-2 

0 

1 

0 

1 

-1 

-2 

0 

1 

o 

_1 

0 

-1 

-1 

1 

0 


0 

0 

1 

1 

-1 

0 


CO CO 

1 + 

' 1 

0 

0 

0 

1 

o 

- > 

0 

1 

0 

-1 

-1 

1 


0 

0 

1 

1 

-1 

0 


Therefore, we could reduce the matrix A into the identity matrix ( A is row equivalent to I ) and thus 
A is invertible, and the inverse matrix is given by the right half of the last matrix. 

Hence the inverse matrix is 


A- 1 = 


0 1 

-1 -1 

1 -1 


Find a nonsingular 2x2 matrix A such that A 3 = A 2 B — 3 A 2 . 

Suppose that A is a nonsingular matrix such that A 3 = A 2 B — 3 A 2 . 

Since A is nonsingular, it is invertible and thus the inverse matrix A~ l exists. 


Then we have 


A = A~ 2 A 3 
= A- 2 (A 2 B - 3A 2 ) 

= A~ 2 A 2 B - 3 A~ 2 A 2 
= IB -31 
= B -31 
1 1 ' 

2 3 





























To prove that this matrix is nonsingular we calculate the determinant of A. 

(Recall that if the determinant of a matrix is nonzero, then it is invertible, hence nonsingular.) 
The determinant of 2 X 2 matrix A is 


det(A) = 1- 3 — 1-2 = 1^ 0. 


Since the determinant of A is nonzero, we conclude that A is nonsingular. 
Thus, the nonsingular matrix A satisfying A 3 — A 2 B — 3A 2 must be 


A = 


1 

2 


1 

3 


Determine whether there exists a nonsingular matrix A if A 2 = AB + 2 A, where B is the following matrix. If 
such a nonsingular matrix A exists, find the inverse matrix A~^. 

(a) 


(b) 



Solution. 

Suppose that a nonsingular matrix A satisfying A 2 = AB + 2 A exists. 
Then A is invertible since A is nonsingular, and thus the inverse A 1 exists. 

Multiplying by A on the left, we have 

A =A~ 1 A 2 =A~\AB + 2A) 

= A ~ X AB + 2 A _1 .A 

= B+ 21, 


where I is the 3x3 identity matrix. 

Therefore, if such a nonsingular matrix exists, it must be 

A = B + 27. 


(a) The first case 

Let us consider the case 



-1 

0 

-2 


(* 


In this case, we have from (*) 


1 

-1 

2 











A = B + 21 = 


1 1 -1 
0 10 
12 0 


We still need to check that this matrix is in fact a nonsingular matrix. 

To check the non-singularity and to find the inverse matrix at once, we consider the augmented matrix 
[A | /] and apply elementary row operations. 

We have 



' 1 

1 

-1 

1 

0 

0 " 

\4 | /] = 

0 

1 

0 

0 

1 

0 


1 

2 

0 

0 

0 

1 



1 

1 

-1 

1 

0 

0 

^1 —^2 

1 

0 

-1 

1 

-1 

0 

1 

CO 








R 3 R 2 







-» 

0 

1 

0 

0 

1 

0 

-> 

0 

1 

0 

0 

1 

0 


0 

1 

1 

-1 

0 

1 
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0 

1 

-1 

-1 

1 


" 1 
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0 


0 

-2 

1 " 








R 1 +R 3 















-» 

0 

1 

0 


0 

1 

0 









0 

0 

1 


-1 

-1 

1 









The left part of the last matrix is the identity matrix, and thus the matrix A is invertible and the inverse 
matrix is the right half: 


A- = 


0 -2 1 

0 10 

-1 -1 1 


(b) The second case 


Next let us consider the case 


B = 


-1 

0 

2 


1 -1 
-1 0 
1 -4 


By (*), if a nonsingular matrix A exists, it must be 


A = B + 21 = 


1 

0 

2 


1 

1 

1 


-1 

0 

-2 


We need to determine whether this matrix is actually nonsingular. 

In fact, we prove that this matrix is singular. 

That is, we show that Ax = 0 has a nonzero solution. 

Consider the augmented matrix A | 0] . By Gauss-Jordan elimination, we have 

























[A \0] = 


' 1 

1 

-1 

0 " 

0 

1 

0 

0 

2 

1 

-2 

0 


*3-2/?, 


1 

1 

-1 

0 " 

E 1 —E 2 

P3FP2 

" 1 

0 

-1 

0 " 

0 

1 

0 

0 

-> 

0 

1 

0 

0 

0 

-1 

0 

0 


0 

0 

0 

0 


The last matrix is in reduced row echelon form and it has a zero row. From this, we see that x 3 must be a 
free variable, and the matrix A is singular. 

(The general solution is x 1 = x 3 , x 2 =0. Thus for example, Xi = 1, x 2 = 0, x 3 = 1 is a nonzero 
solution of Ax — 0 .) 

Thus, we conclude that there is no nonsingular matrix A satisfying A 2 = AB + 2 A in this case. 


0 Determine whether there exists a nonsingular matrix A if A 4 = ABA 2 + 2 A 3 , where B is the following 

"-1 1 - 1 " 

matrix. B — 0 —1 0 .If such a nonsingular matrix A exists, find the inverse matrix A~ 1 . 

_ 2 1-4. 

(The Ohio State University, Linear Algebra Final Exam Problem) 


Solution. 

Assume that A is nonsingular. 

Then the inverse matrix A ~ 1 exists and we have 


A =A~ 1 A 4 A “ 2 
= A~ 1 (ABA 2 + 2\A 3 )A~ 2 
= A ~ 4 ABA 2 A - 2 + 2 A ~ 4 A 3 A - 2 
= B+ 21, 


where / is the 3x3 identity matrix. 
Thus we have 


A = B + 21 = 


11-1 

0 10 


2 1 - 2 . 

We compute the determinant of A by the second row cofactor expansion as follows. 


det(/l) 


1 1 1 
0 10 

2 1-2 


1 -1 

2 -2 


= 0 . 


This contradicts the assumption that A is nonsingular. 

Therefore, there is no nonsingular matrix A satisfying the relation A 4 — ABA 2 + 2 A 3 . 






















Let A , B , (7 be the following 3x3 matrices. 


'1 2 3" 


'1 0 1" 


'-i o r 

4 5 6 

,B = 

0 3 0 

,c = 

0 5 6 

1- 

-a 

00 

CO 


.1 0 5. 


3 0 1. 


Then compute and simplify the following expression. 

(A t - B) t + CiB-'C)- 1 . 

(The Ohio State University) 

Solution. 

We first use the properties of transpose matrices and inverse matrices and simplify the expression. 
Note that we have 

(. A t - B) t = ( A t ) t -B t = A- B 

since the double transpose (A T ) T = A and B is a symmetric matrix. 

Also, note that we have 

(. B~ l C)~ l = C _1 (5 _1 ) _1 = C~ X B 

since (l? -1 ) -1 = B. Care must be taken when you distribute the inverse sign in the first equality. 
We needed to switch the order of the product. 

Then we have 


C(B- l C)~ x = CC X B = IB = B, 
where I is the 3 X 3 identity matrix. 

Therefore, the given expression can be simplified into 

(^ T - B) t + C(B~ 1 C)~ 1 = A-B + B = A. 


Hence we have 


(^ T - b) t + c(B~ l cy l 


A = 


2 

5 

8 


3 

6 

9 


Common Mistakes 


Here are two common mistakes. 


The first one is 

(B-'C)- 1 = (.rT'C- 1 (This is wrong!!). 

Note that in general, we have 

(. AB ) _1 = B~ l A~ l (Note the order of products). 

The second common mistake is that 










CBC 1 = CC 1 B — B (The first equality is wrong!!). 

Recall that in general matrix multiplication is not commutative, meaning that 

AB ± BA. 

And surprisingly, if you combine these two mistakes, you get the right answer. 

C(5 : C) 1 = CBC 1 = B. (This is wrong!!). 

However, these mistakes show that you didn’t understand matrix operations including transpose and 
inverse matrices. 


0 Let A be the coefficient matrix of the system of linear equations 


— X\ — 2X2 — 1 
2xi + 3x2 — — 1 . 


(a) Solve the system by finding the inverse matrix A 1 . 

Xi 


(b) Let x = be the solution of the system obtained in part (a). Calculate and simplify 

A 2017 x. 

(The Ohio State University) 


Solution. 

(a) Solve the system by finding the inverse matrix A ~ 1 
The coefficient matrix A of the system is 

A = 


-1 -2 
2 3 


The determinant of the matrix A is det(v4) = ( — 1)(3) — ( — 2)(2) = 1 , and thus A is invertible 

and the inverse matrix can be found by the 2 X 2 inverse matrix formula 


A~ l = 


if A = 

Thus 


a b 
c d 


ad — be 
and det(v4) = ad — be ^ 0 . 


d —b 
—c a 


A ' = 


3 2 

-2 -1 


(Or you could form the augmented matrix [A \ I] and find A .) 


Now the given system of linear equations can be written as 














A 


■*r 


' l ' 

_* 2 _ 


_-i_ 


Multiplying by A 1 on the left, we obtain 


xi 

X2 


= A 


-l 


3 2 

-2 -1 


Thus the solution of the system is X i = 1, X 2 = — 1 , or in the vector form 


"xi" 


' l ' 

_*2_ 


,-i. 


(b) Calculate and simplify^ 2017 x 

is the solution of the system Ax = 
Ax = x. 


Since x 


~Xl 


' l ' 

_X2_ 


,-i. 


1 

-1 


, we see that 


From this, we have 


Ax— AAx — Ax — x 
A 3 x = AA 2 x = Ax = x 
A 4 X — AA 3 X — Ax — x 


Repeating this, we see that 


A 2017 x = x. 


(To be more precise, you can prove that A n x — x for any positive integer ft by mathematical induction 
on ft.) 

Thus, we have 


A 201T x = 


1 

-1 


(a) Let A be a 6 X 6 matrix and suppose that A can be written as A = BC, where B is a 6 X 5 matrix and C is 
a 5 X 6 matrix. Prove that the matrix A cannot be invertible. 

(b) Let A be a 2 X 2 matrix and suppose that A can be written as A — BC, where B is a 2 X 3 matrix and C is 
a 3 X 2 matrix. Can the matrix A be invertible? 


Solution. 

(a) Prove that the matrix A cannot be invertible. 
Since C is a 5 X 6 matrix, the equation 


Cx = 0 


has a nonzero solution x i. 

(There are more variables than equations in the system Cx = 0 .) 




























It follows that we have 


Ax i = BCx i = 50 = 0. 


Since the vector x i is nonzero, the matrix A is a singular matrix, hence A is not invertible. 


(b) Can the matrix A be invertible? 

The answer is yes. For example consider the following 2x3 matrix B and 3x2 matrix C : 


'1 0 

o ' 

, c = 

'1 

o ' 

0 

1 

.0 1 

0 . 


.0 

0 . 





Then we have 


A — BC = 



'1 o' 


10 0 




1 0 



0 1 

= 


0 10 




0 1 


o 

o 



and the matrix A is invertible. 


For a real number a, consider 2x2 matrices A , P, Q satisfying the following five conditions. 

(1) A = aP + (a + 1)Q, (2) P 2 = P, (3) Q 2 = Q, (4) PQ = O, (5) QP = O, where O is the 2 X 2 
zero matrix. Then do the following problems. 

(a) Prove that (P + Q) A = A. 


(b) Suppose a is a positive real number and let A 


a 0 
1 CL A~ 1 


. Then find all matrices P, Q satisfying 


conditions (l)-(5). 

(c) Let n be an integer greater than 1. For any integer k, 2 < k < n, we define the matrix A^ — 

Then calculate and simplify the matrix product A n A n _iA n _2 * * * ^2- 
(Tokyo University Entrance Exam 2007) 


0 

k + 1 


Solution. 

(a) Prove that [P + Q)A = A . 


We have 


(P + Q)A — (P + Q)(aP + (a + 1 )Q) 

— aP ^ + {cl T" 1)P Q A~ clQP A~ {cl A~ l)jQ^ 

— clP A~ {cl + 1)0 T - clO T~ + 1)|Q 
[by (2), (3), (4), (5)] 

= aP + (a + 1)P = A. 


Hence, we obtain 


(P + Q)A — A 

















as required. 

(b) Find all matrices P , Q 


Note that the matrix A 


a 0 
1 ci T - 1 

det(^) = 


is invertible since the determinant 


a 0 
1 u ~\~ 1 


= a(a + 1) ^ 0. 


By part (a), we know {P + Q)A = A . 

Multiplying this by A from the right, we have 

P + Q = I, 


where / is the 2x2 identity matrix. 

Substituting Q — I — P into the equality^ = aP + (a + 1)Q of (1), we have 

A = aP + (a + l)(I-P) 

= (a + 1)7 — P. 

Thus, we have 

P = (a + 1)1 - A 


ci A~ 1 

0 


a 

0 

0 

Cl H - 1 _ 


.1 

Cl -h 1 _ 


1 0 


and thus 


1 0 
0 1 

0 0 
1 1 


1 0 

-1 0 


It is straightforward to check that the matrices 

0 0 
1 1 


p = 


1 0 

-1 0 


and Q — 


satisfy conditions (l)-(5). 

Hence these are the only matrices satisfying conditions (l)-(5). 

(c) Calculate and simplify the matrix product A n A „ \ A n 2 ■■■ A ■> 
In part (2), we showed that for any positive integer k 


Ak — kP + (k + 1 )Q, 






















where 


P = 



0 

0 


and Q = 


0 

1 


0 

1 


(We just applied the result of (b) with a — k.) 


Thus we have 


^4 n —\ • • • A 2 

= (nP + (« + 1 )Q) ((« - 1 )P + nQ)■■■ (2 P + 30) 


= n\P + 


(n + 1)! 
2 


Q 


We used conditions (4) and (5) in the second equality. 
Using the explicit matrices for P and 0, we have 

, (n + 1)! 
n\P + -- ’-Q 


= n\ 


1 o' 

{n + l)\ 

'0 o' 

.-1 0. 

2 

.1 1. 


n\ 


0 


I i U + 1 ) 1 ("+ 1 )! 


-n\ + 


2 J 


n\ 


nl 


0 


I n —1 («+!)! 


Note that in the last step, we computed 


. (n + 1)! , (n + l)n\ 

n\ + --— = —n\ + - --— 


= n!(-l+^±i) 


= n\- 


n — 1 


In conclusion, we have obtained 


^ nA n—1 %%% A 2 — 


n\ 


n\ 0 

I n—l (w+1)! 


Comment. 


Another way to solve (c) is that one first guesses the formula we obtained and prove it by mathematical 
induction. 


G 


Let v be a nonzero vector in R n . Then the dot product v • v = v T v ^ 0. Set a := —^ and define the n X n 
matrix A by A = I — avv T , where / is the n X n identity matrix. Prove that A is a symmetric matrix and 
A A = /. Conclude that the inverse matrix is A -1 = A. 



























Proof. 

A is symmetric 


We first show that the matrix A is symmetric. 

We calculate using properties of transpose 

A t = (/ — avv T ) T by definition of A 

= I T — (avv T ) T 
— I — a(v T ) T v T 
— I — avv T 

— A by definition of A. 


Hence we have A T = A , and thus A is symmetric. 

AA = / and^4 ~ 1 = A 

Next, we prove thatv4^4 = / . 


We compute 

AA = (I - avv T )(I - avv T ) 

= /(/ — avv T ) — avv T (I — avv T ) 

r p T 9 T T 

— I — avv — avv + a vv vv 
— I — 2 avv T + a 2 vv T vv T . 

Note that we have 


T T / T \ T 

vv vv =v(v vjv 



a 


by definition of a ^ 0 


Plugging this relation into (*), we obtain 


AA — I — 2 <xvv T 


9 ^ T 

+ a — vv 1 


= /. 


(* 


Thus we get v4v4 = / . 

This implies that the inverse matrix of A is A itself: A ~ 1 — A . 
0 Consider the system of linear equations 

xi = 2, 

—2xi + x 2 = 3, 

5®i — 4^2 + xs — 2 


(a) Find the coefficient matrix and its inverse matrix. 

(b) Using the inverse matrix, solve the system of linear equations. 
(The Ohio State University) 


Solution. 

(a) Find the coefficient matrix and its inverse matrix. 
The coefficient matrix is 


A := 


1 

-2 1 
5 -4 


0 0 
0 


To find the inverse, we reduce the augmented matrix 


1 

0 

0 

1 

0 

0 " 

-2 

1 

0 

0 

1 

0 

5 

-4 

1 

0 

0 

1 


Using the elementary row operation. 


1 

0 

0 

1 

0 

0 " 

R 2 +2 R 1 
-» 

" 1 

0 

0 

1 

0 

0 " 

-2 

1 

0 

0 

1 

0 

0 

1 

0 

2 

1 

0 

5 

-4 

1 

0 

0 

1 

*3-5*1 

0 

-4 

1 

-5 

0 

1 


" 1 

0 

0 

1 

0 

0 " 

0 

1 

0 

2 

1 

0 

0 

0 

1 

3 

4 

1 


R 3 2 


Now that the left 3 X 3 part became the identity matrix (this form is called the reduced row echelon form), 
the inverse matrix is 


A~ x = 


10 0 
2 10 
3 4 1 


(b) Using the inverse matrix, solve the system of linear equations. 

Using the coefficient matrix^ the given system can be written as the matrix equation 



Xi 


"2" 

A 

*2 

= 

3 


_*3_ 


.2. 


Multiplying it by the inverse matrix A 1 on the left, we get 


Xl 

*2 

= A - 1 

CM CO 

1_ 

1 

CO 

X 


.2. 


'1 0 O' 


"2" 


" 2 " 

2 10 


3 

= 

7 

.3 4 1 . 


_2_ 


.20. 


Therefore the solution of the system is x i = 2 , x 2 = 7,.\'3 = 20 . 
Comment. 



































Once you obtained the inverse matrix A 1 in part (1), it’s better to calculate AA 1 using your matrix. 
If AA -1 is not the identity matrix, you made a calculation mistake somewhere. 

Also, in part (b), after finding the solution of the system, you may want to substitute your answer to the 
original system to see if they are actually the solution. 

These extra (but not so time consuming) computations reduce careless mistakes in a exam. 


Consider the following system of linear equations 


2 x + 3y + z = — 1 
3x + 3y + z = 1 
2 x H - 4 y -T z — —2. 


(a) Find the coefficient matrix A for this system. 

(b) Find the inverse matrix of the coefficient matrix found in (a) 

(c) Solve the system using the inverse matrix A~ 1 . 


Solution. 

(a) Find the coefficient matrix A for this system. 
The system can be written as 


Ax = b, 


where 


A = 


2 3 

3 3 
2 4 


1 

1 

1 


is the coefficient matrix of the system and 


X = 

X 

y 

and b = 

■-i" 

i 


z 


-2 


(b) Find the inverse matrix of the coefficient matrix 

We apply the elementary row operations to the augmented matrix [A \ I] , where / is the 3 X 3 identity 
matrix. 









Ml i} = 


" 2 

3 

1 

1 

0 

0 " 

R 2 R i 

’ 2 

3 

1 

1 

0 

0 " 

3 

3 

1 

0 

1 

0 

R 3 R i 
-> 

1 

0 

0 

-1 

1 

0 

2 

4 

1 

0 

0 

1 


0 

1 

0 

-1 

0 

1 


R i 2 


then R o <->R 3 


R3SR2 
- > 


1 

0 


Now the left 3 X 3 part has become the identity matrix. 

So the matrix A is invertible and the inverse is given by the right 3 x 3 part. 
Hence we obtain 


A = 


-11 0 
-1 0 1 

6 -2 -3 


1 

0 

0 

1 

0 

0 

-1 1 

-1 0 

0 " 
1 

R 3-2 Rl 
-» 

" 1 

0 

0 

1 

0 

0 

-1 

-1 

1 

0 

0 " 
1 

2 

3 

1 

1 0 

0 


0 

3 

1 

3 

-2 

0 

0 

0 

— 

1 1 

0 " 








1 

0 

— 

1 0 

1 








0 

1 

6 -2 

-3 









(c) Solve the system using the inverse matrix^ -1 . 

As noted in (a), the system can be written using matrices as 

Ax — b. 

Multiplying by the inverse A on the left, we have 

A~ 1 Ax = A~ 1 b 
lx — A -1 b 
x — A : b. 


Therefore the solution x of the system is given by 

X = A~ 1 b 

'-1 1 
= -10 
. 6 -2 

' 2 ' 

= -1 • 

-2 


0 

1 

-3 


-1 

1 

-2 


Thus, the solution of the system is 

x = 2 ,y = -l,z = -2. 

^ Consider the matrix 





























A = 


1 2 1 
2 5 4 

1 1 0 


(a) Calculate the inverse matrix A 1 . If you think the matrix A is not invertible, then explain why. 

(b) Are the vectors 


Ai = 

to 

5 A 2 — 

"2" 

5 

, and A 3 — 

1 - 


1 


1 


0 


linearly independent? 

(c) Write the vector b = 


as a linear combination of Ai, A 2 , and A 3 . 


(The Ohio State University, Linear Algebra Exam ) 


Hint. 


For (a), consider the augmented matrix [A |/] and reduce it. 


Note that given vectors are column vectors of the matrix A , 


Use the inverse matrix A 1 to solve a system. 


Solution. 

(a) Calculate the inverse matrix A -1 
We consider the augmented matrix 


1 

2 

1 

1 

0 

0 " 

2 

5 

4 

0 

1 

0 

1 

1 

0 

0 

0 

1 


and reduce this matrix using the elementary row operations as follows. 


1 

2 

1 

1 

0 

0 

R 2 -2Rx 

1 

2 

1 

1 

0 

O 

2 

5 

4 

0 

1 

0 

-» 

0 

1 

2 

-2 

1 

0 

1 

1 

0 

0 

0 

1 

1 

CO 

0 

-1 

-1 

-1 

0 

1 


Ri-2R 2 


R3+R2 


1 

0 

-3 

5 

-2 

0 

Rl+SRs 

1 

0 

0 

-4 

1 

3 

0 

1 

2 

-2 

1 

0 

-» 

0 

1 

0 

4 

-1 

-2 

0 

0 

1 

-3 

1 

1 

R2—2R3 

0 

0 

1 

-3 

1 

1 


Since the left 3 x 3 part of the last matrix is the identity matrix, the inverse matrix of A is 


A~ x = 


-4 1 3 

4 -1 -2 

-3 11 


































(b) Are the Vectors Linearly Independent? 

To check whether A i, A 2 , A 3 are linearly independent, we consider the linear combination 

X 1 A 1 +x 2 A 2 +X 3 A 3 = 0 

and if this equation has only zero solution, the vectors are linearly independent. 


This equation can be written as the matrix equation 


Ax = 0 , 


where x = 


xi 

*2 


U3J 

Since by part (a), the inverse matrix A -1 exists. Thus multiplying by A ~ 1 on the left we get x 
the solution is x = 0 and the vectors A 1 , A 2 , A 3 are linearly independent. 


0 . Thus 


Another Solution of (b) 

Note that A 1 , A 2 , A 3 are columns vectors of the matrix A . We proved in part (a) that A is invertible. We 
know that the column vectors of an invertible matrix are linearly independent. Thus the vectors 
A 1 , A 2 , A 3 are linearly independent. 

(c) Write the vector b as a linear combination of A 1 , A 2 , and A 3 
We want to solve the vector equation 

X 1 A 1 +x 2 A 2 +X 3 A 3 = b. 

This can be written as the matrix equation 


Ax — b. 


Since A is invertible, we have 


x — A -1 b = 


-4 13 

4 -1 -2 

-3 1 



'1" 


' 0 ' 


1 

= 

1 


. 1 . 




Thus x 1 = 0,X2 — 1 , andx3 = — 1 and the linear combination is 


b = 


= A 2 — A 3 . 


®A2x 2 matrix A satisfies tr(A 2 ) = 5 and tr(A) = 3. Find det(A). 


Solution 1 of (a) A 2 x 2 matrix^ satisfies tr (A 2 ) = 5 and tr (A) = 3. Find det (A ). 
Let 














Then we have 



a b 

a 

b 


a 2 + be 

ab + bd 

e d _ 

c 

d 


_ac + cd 

be + d 2 _ 


Since tr (A 2 ) = 5 and tr (A) — 3 , we obtain 

5 = tr (A 2 ) = ( a 2 + be) + {be + d 2 ) — a 2 + 2 be + d 2 and 
3 = tr(v4) — a + d. 


We find the determinant det(v4) — ad — be as follows. 
We have 


det(v4) — ad — be — 

= |(3 2 -5) 


\ ( (A + ^) 2 

= 2 . 


(a 2 + 2Z?c + J 2 )) 


Thus, we obtain det(^4) = 2 . 

Solution 2 of (a) 

Let X i and 2 2 be eigenvalues of A . 

Then we have 

3 = tr(v4) = X 1 + 2 2 and 
5 = tr(^4 2 ) — X\ + X\. 


Here we used two facts. 

The first one is that the trace of a matrix is the sum of all eigenvalues of the matrix. 

The second one is that X 2 is an eigenvalue of A 2 if X is an eigenvalue of A , and these are all the 
eigenvalues of A 2 . 

Since the determinant of A is the product of eigenvalues of A , we have 

det(^4) = X\X<l 

= ^((^1 +^ 2) 2 - (^1 +^ 2 )) 

= f( 3 2 - 5 ) 

= 2 . 


Hence we have det (A ) = 2 . 

Solution 1 of (b) A 2 X 2 matrix has two parallel columns and tr (A) = 5. Find ti(A 2 ). 
Since two columns are parallel, we can write A as 











Then we have 


5 = tr (A) = a + rc. 


We use the formula in Solution 1 of (a) for tr(A 2 ) with b — ra and d = rc , and we compute 

tr(A 2 ) — a 2 + 2 (ra) + (rc) 2 
= (a + rc) 2 
= 5 2 = 25. 


Thus, we find tr(A 2 ) = 25. 

Solution 2 of (b) 

Since two columns are parallel, the matrix A is singular. Hence A has an eigenvalue 0. 

Since the sum of all the eigenvalues is tr(A) = 5 , we see that 0 and 5 are eigenvalues of A . 

It follows that 0 and 25 are eigenvalues of A 2 . Hence 

tr(A 2 ) = 0 + 25 = 25. 

Solution of (c) A 2 X 2 matrix + has det(A) = 5 and positive integer eigenvalues. What is the 
trace of ^4 ? 

The product of eigenvalues of A is the determinant det(^4) = 5 . 

Since eigenvalues are positive integers, it follows that 1 and 5 are eigenvalues of A . 

It follows that 


tr (A) = 1 + 5 = 6. 


Suppose that a real matrix A maps each of the following vectors 



"1" 


"0" 


"o' 

Xl = 

1 

>X 2 = 

1 

>X 3 = 

0 


1 


1 


1 


into the vectors 



"1" 


■-1" 


" 3 " 

Y\ = 

2 

,y2 = 

0 

,ys = 

1 


_0_ 


. 3 . 


_i_ 


respectively. That is, = y i for i = 1, 2, 3. Find the matrix A. 
(Kyoto University Exam) 


Solution 1 

By the condition that Ax \ — y - , we have a matrix equality A [x 1 X 2 X 3 ] = [ 71 + 2 }+] • 
Explicitly we have 



















"l 

0 

O' 


'1 

-1 

3 ' 

A 

1 

1 

0 

= 

2 

0 

1 


1 

1 

1 


0 

3 

1 


To find the matrix A , we find the inverse matrix of 


1 0 0 
1 1 0 
111 


and multiply on the right by it. 


We use Gauss-Jordan elimination to transform the augmented matrix [A |/] into [I\A X ] 
We have 


" 1 

0 

0 

1 

0 

1 

O 

R2—R1 
-» 

" 1 

0 

0 

1 

0 

0 " 

1 

1 

0 

0 

1 

0 

0 

1 

0 

-1 

1 

0 

1 

1 

1 

0 

0 

1 

^3—^1 

0 

1 

1 

-1 

0 

1 


R 3 —R 2 


" 1 

0 

0 

1 

0 

0 " 

0 

1 

0 

-1 

1 

0 

1- 

0 

0 

1 

0 

-1 

1 


Thus the inverse matrix is 


100 

-110 

0-11 


Hence we have 


'1 -1 3 ' 


' 1 0 O' 


'2 -4 3 ' 

2 0 1 


-110 

= 

2-11 

.0 3 1 . 


. 0 - 11 . 


.-3 2 1 . 


Solution 2 


Note that in general A is a matrix whose /-th column is the image of the z-th unit vector e z . 
For given vectors X;, it is easy to see that 


ei = xi — x 2 
e 2 = x 2 - x 3 
e3 = x 3 . 


Thus we have 


Ae 1 = ^ 4 xi — Ax 2 


Yi 



Similarly, we obtain 


Ae 2 — 

'- 4 ' 

-1 

, andv 4 e 3 = 

' 3 ' 

1 


2 


1 


Therefore we have 

































A = [AeiAe2Ae3 ] 


2-4 3 
2-11 
-3 2 1 


Of course, this agrees with the answer of Solution 1. 

Comment. 

This is a typical problem in linear algebra, but if you noticed the second approach then you could save time 
in the exam. 


Once you obtained the matrix A , you can check whether you computed right by calculating Ax / with 
your A and see if it is equal to the vector y t . 


Let A and B are n X n matrices with real entries. Assume that A + B is invertible. Then show that 

A(A + B)~ 1 B = B(A + B)~ x A. 

(University of California, Berkeley Qualifying Exam) 


Proof. 

Let P =A + B . Then 2? = P - A . 

Using these, we express the given expression in terms of only A and P . 

On one hand, we have 

A(A + B) l B = AP _1 (P -A) = AP X P -AP~ 1 A=A- AP _1 A. 

On the other hand we have 

B{A + BY 1 A = {P - A)PA = PP ~ X A -AP~ 1 A=A- AP ^A. 

Thus these are equal. 

This completes the proof. 

O Let A be an n X n invertible matrix. Prove that the inverse matrix of A is uniques. 


Hint. 

That the inverse matrix of A is unique means that there is only one inverse matrix of A . 

(That’s why we say “the” inverse matrix of A and denote it by A -1 .) 

So to prove the uniqueness, suppose that you have two inverse matrices B and C and show that in fact 

B = C. 

Recall that B is the inverse matrix if it satisfies 

AB = BA = /, 


where / is the identity matrix. 







Proof. 

Suppose that there are two inverse matrices B and C of the matrix A . Then they satisfy 

AB = BA = I (*) 

and 

AC = CA= I. (**) 

To show that the uniqueness of the inverse matrix, we show that B = C as follows. Let / be the n X n 
identity matrix. 

We have 


BI 


B(AC) 

by (**) 

(BA)C 

by the associativity 

1C 

by (*) 

c. 



Thus, we must have B — C , and there is only one inverse matrix of A . 

0 Let u and v be vectors in M n , and let I be the n X n identity matrix. Suppose that the inner product of u and v 
satisfies 

v T u ^ -1. 

Define the matrix 

A — I + uv T . 

Prove that A is invertible and the inverse matrix is given by the formula 

A~ 1 — I — auv T , 

where 

1 

a — -——. 

1 +v T u 

This formula is called the Sherman-Woodberry formula. 


Proof. 

Let us put 

B — I — r/uv T , 

the matrix given by the Sherman-Woodberry formula. 

We compute AB and BA and show that they are equal to the identity matrix /. 


Let us first compute the matrix product AB . We have 




AB = (/ + uv T )(/ — auv T ) 

rp rp rp m 

— I — auv + uv — auv uv 

= /+(! — a)uv T — miv T uv T (*) 


By using the defining formula for a 


^ , T , we have 
l+v i u 


a{ 1 + v T u) = 1, 


and thus 


1 - 


a — aw 


T 


u. 




r | 1 r | 1 r | 1 r | 1 

Note that the third term in (*) — cmv uv contains v u in the middle, and v u is just a number. Thus 
we can factor out this number and get 

—auv T uv T = —au(v T u)v T 

= —a(v T u)uv T (***) 


Inserting (**) and (***) into (*), it follows that we have 

AB — I + (<tv t u)uv t — a(v T u)uv T 
= /. 


Thus we have proved AB — I . 


Now we compute BA . We have 

BA = (/ — auv T )(/ + uv T ) 

rp rp rp rp 

= / + uv — a uv — auv uv 
= /+(! — a)uv T — auv T uv T 


and this is exactly the expression (*), hence BA — AB — I 


Therefore, we conclude that the matrix A is invertible and the inverse matrix is B . Hence 

A -1 — I — auv T 

and we have proved the Sherman-Woodberry formula. 

Comment. 

The invertible matrix theorem says that once we have AB = / , then we have automatically BA — I and 
the inverse matrix of A is B , that is, A 1 — B . 

So in the above proof, after proving AB — I , you may conclude that A is invertible and A ~ l — B . 



o 


Let A be a singular 2x2 matrix such that tr(A) 7 ^ — 1 and let / be the 2 X 2 identity matrix. Then prove that 
the inverse matrix of the matrix / + A is given by the following formula: 


(I + A)- 1 


I- 


1 + tr(j4) 


-A. 


Using the formula, calculate the inverse matrix of 


2 

1 


1 

2 


Proof. 


We have 


{I + A)(l- 


A =/- 


-AAA - —— A 2 


1 + tr(^) ) 1 + tr(^) 1 + tr(^) 

=/ -rrUy ( ' 4 “ (1+trM)J+ ' 42) 

='-taaa)^ 2 -^A- 

The Cayley-Hamilton theorem for 2 X 2 matrices yields that 

A 2 — tr (A)A + det (A)I = O. 


Since A is singular, we have det (A) = 0 . 
Hence it follows that we have 


A 2 - tr (A)A = O, 


and we obtain from (*) that 


(I + A) /- 


1 + tr(^4) 


A =/. 


Similarly, 


I- 


A (I + A) = I. 


1 + tr(yt) 

Therefore, we conclude that the inverse matrix of I+A is given by the formula 

1 


(I+A)- 1 =1 


Find the inverse matrix of 


2 1 
1 2 


1 + tr(^) 

using the formula 


-A. 


Now let us find the inverse matrix of 


We first write 


2 1 
1 2 


using the formula. 


2 1 
1 2 


— I + A , 


(*) 





















where 


A = 


1 1 
1 1 


Then A is a singular matrix with tr(A ) — 2. 
The formula yields that 


2 1 
1 2 


-i -l 


= (/ + A) 


-1 


= /- A 

3 


2 -1 

-1 2 


Related Question. 


There is a similar formula for inverse matrices of certain n X n matrices, called Sherman-Woodberry 
formula. 

See the post i 

Sherman-Woodbery Formula for the Inverse Matrix 

for the statement of the Sherman-Woodberry formula and its proof. 


©Let A = 
(a) A n = 


a 0 

0 b 
a n 0 
0 b n 

(b) Let B — S^ 1 AS, where S be an invertible 2x2 matrix. Show that B" — S L A n S for any n € N 


. Show that 

for any n G N. 


Hint. 

Use mathematical induction. 

Proof. 

.. a n 0 

(1) We prove A n = 

L 0 If 

The base case n — 1 is true by definition. 


by induction on n. 


Suppose that A ' 


a k 0 

0 ^ 

A k+l = AA k = 


. Then we have 


a 0 


V 

o' 


.0 b. 


. 0 

a*. 



~k+l 


o 

zA +1 


Here we used the induction hypothesis in the second equality. 
Hence the inductive step holds. This completes the proof. 
























(2) We show that B n — S l A n S by induction on n . 
When n — 1 , this is just the definition of B . 

For induction step, assume that B k — S~ 1 A k S. 
Then we have 


B k+1 — BB k = (5“ 1 ^)(5“ 1 ^^) = S~ 1 AA k S = S~ 1 A k ~ hl S, 

where we used the induction hypothesis in the second equality and the third equality follows by canceling 
SS _1 = 1 2 in the middle. 

Thus the inductive step holds, and this competes the proof. 

Let A be an n X n invertible matrix. Then prove the transpose AA is also invertible and that the inverse matrix of 
the transpose A T is the transpose of the inverse matrix A~ 1 . Namely, show that (A T ) _1 = (^4 -1 ) T . 


Proof. 

By definition of inverse matrices, if there is an n X n matrix B such that 

A t B = / and BA T = /, 

where / is the n X n identity matrix, then A T is invertible and its inverse is B , that is, B — (A T ) -1 

We claim that we can take (A ~ 1 ) T for this B . 

In fact, we have 

A t (A~ 1 ) t = (A^Af =/ T =/. 

Here in the first equality, we used the fact about transpose matrices that 

(CD) t =d t c t 

for any matrices C, D such that the matrix product CD is well-defined. 

Similarly, we have 

(^ -1 )L4 t = (AA^f =/ T =/. 

This proves the transpose A T is invertible and that (A _1 ) T is the inverse matrix of A T , that is, 

(A'r^iA-y 

as required. 


A square matrix A is called nilpotent if there exists a positive integer k such that A k — O, where O is the zero 
matrix. 

(a) If A is a nilpotent n X n matrix and B is an n X n matrix such that AB — BA. Show that the product AB 
is nilpotent. 

(b) Let P be an invertible n X n matrix and let N be a nilpotent n X n matrix. Is the product PN nilpotent? If 
so, prove it. If not, give a counterexample. 



Hint. 

For (b), the statement is false. Try to find a counter example. 

A typical nilpotent matrix is an upper triangular matrix whose diagonal entries are all zero. 

Proof. 

(a) Show that AB is nilpotent 

Since A is nilpotent, there exists a positive integer k such that A k — O. Then we have 

(AB) k = (AB)(AB) ■ ■ ■ (AB) =A k B k = 0B k = 0. 

Here in the second equality, we used the assumption that AB — BA . 

Thus we have (AB) k — O , hence the product matrix AB is nilpotent. 


(b) Is PN nilpotent? 

In general, the product PN of an invertible matrix P and a nilpotent matrix N is not nilpotent. 
Here is a counterexample. 

Let 


'1 

0 

o' 


'0 

1 

1 

1 

1 

0 

and TV = 

0 

0 

1 

.0 

0 

1 . 


.0 

0 

0 . 


Then the matrix P is invertible since det (P) — 1. 

(Note that P is a lower triangular matrix. So the determinant is the product of diagonal entries.) 


Also, it is easy to see by direct computation that N 3 


O , hence N is nilpotent. Indeed, 


N 2 


0 0 1 
0 0 0 
0 0 0 


and 



'0 0 1' 


'o i r 

N 3 =N 2 N = 

0 0 0 


0 0 1 


o 

o 

o 


o 

o 

o 


Now the product PN is 


PN = 


0 

0 

0 


1 

1 

0 


1 

2 

0 


We show that PN is not nilpotent. 
We have 














(PN) 2 


0 12 
0 12 
0 0 0 


(PN) 3 = (PN) 2 (PN) = 

"0 1 2' 

0 12 


"o i r 

0 12 

— 

"0 1 2' 

0 12 


O 

O 

o 


o 

o 

o 


o 

o 

o 


This calculation shows that 


(PN) k 


0 1 
0 1 
0 0 


2 

2 

0 


7 ^ O for all k > 2. 


Thus PN is not nilpotent. In conclusion, the product PN of the invertible matrix P and the nilpotent 
matrix N is not nilpotent. 


(a) Show that if A is invertible, then A is nonsingular. 

(b) Let A, B, C be n X n matrices such that AB — C. Prove that if either A or B is singular, then so is C . 

(c) Show that if A is nonsingular, then A is invertible. 


Definition (Nonsingular Matrix) 

An ft X n matrix A is called nonsingular if the equation Ax — 0 has only the zero solution x = 0 . 

For basic properties of a nonsingular matrix, see the problem Properties of nonsingular and singular 
matrices. 

The result of this problem will be used in the proof below. 

Proof. 

(a) If A is invertible, then A is nonsingular 

Suppose that A is invertible. This means that we have the inverse matrix A -1 of A . 

Consider the equation Ax = 0 . We show that this equation has only zero solution. 

Multiplying it by A ~ 1 on the left, we obtain 

A~ 1 Ax=A~ 1 0 
^ x = 0 


Hence A is nonsingular. 

(b) If either^ or B is singular, then so is C 
Suppose first that the matrix B is singular. 

Then there exists nonzero vector b 0 such that 5b = 0 . Then we have C b = ABb — A0 = 0 
Since b is a nonzero vector, the matrix C is singular. 













Next, we assume that B is nonsingular and A is singular. 

Since A is singular, there exists nonzero vector yo such thatv4y 0 = 0 . 

Then consider the equation Bx — y 0 has a unique solution x o by part (c) of Properties of nonsingular 
and singular matrices. 

The vector Xo is nonzero because y 0 is nonzero. 

Then we have 


Cx 0 = ABxq = Ay 0 = 0. 

Therefore Cx — 0 has nonzero solution x o, hence C is singular. 

(c) If A is nonsingular, then A is invertible 

Suppose that A is nonsingular. Let e z be the n -dimensional vector whose entries are all 0 but 1 in the ith 
place. 

Then the equations Ax — e z has a unique solution x z - for i — 1,..., n . 

Create a matrix B whose z-th column vector is X/, namely B = [x 1 X 2 • • • x w ] . 

Then we have AB — [e 1 e 2 • • • ] = I n • 

Hence B is the right inverse of A . 

Note that the identity matrix I n is nonsingular. Thus by part (c) of Properties of nonsingular and singular 
matrices, the matrix B must be nonsingular as well. 

We repeat the above argument using B instead of A . Then there exist a matrix C such that BC — I n . We 
claim that C — A . 

To see this, multiply AB — I n by C on the right, we get ABC — C . 

Since BC — I n , we get A — C . 

In summary, we obtained AB — I n and BA — I n . Thus A is invertible with the inverse B . 

A square matrix A is called nilpotent if some power of A is the zero matrix. Namely, A is nilpotent if there exists 
a positive integer k such that A k — O, where O is the zero matrix. Suppose that A is a nilpotent matrix and let B 
be an invertible matrix of the same size as A. Is the matrix B — A invertible? If so prove it. Otherwise, give a 
counterexample. 


Solution. 

We claim that the matrix B — A is not necessarily invertible. 
Consider the matrix 


A = 


0 

0 



This matrix is nilpotent as we have 





1 

1—1 

1 

o 


1 

o 

1 

I— 1 
_1 


1 

o 

o 
_1 

1 

o 

o 
_1 


1 

o 

o 

1_ 


1 

o 

o 

1_ 


Also consider the matrix 


B = 


1 

1 


0 

1 


Since the determinant of the matrix B is 1, it is invertible. 


So the matrix A and B satisfy the assumption of the problem. 
However the matrix 


is not invertible as its determinant is 0. 

Hence we found a counterexample. 

Related Question. 

Here is another problem about a nilpotent matrix. 


Problem. 

Let A be an n X n nilpotent matrix. Then prove that / — A, I + A are both nonsingular matrices, 
where / is the n X n identity matrix. 


Let A be an n X n matrix. The (z, j) cofactor Cij of A is defined to be — ( — 1)^ det (M^), where M^j 
is the (z, j) minor matrix obtained from A removing the z-th row and j- th column. Then consider the n X 77- 
matrix C — (Cij), and define the n X n matrix Adj(A) = C T . The matrix Adj(A) is called the adjoint 
matrix of A. When A is invertible, then its inverse can be obtained by the formula 

A~ l = —L-AdjM). 
det (A) JV ’ 


For each of the following matrices, determine whether it is invertible, and if so, then find the invertible matrix 
using the above formula. 


'1 

(a) A — 0 

0 

"1 

(b) B — 0 

3 


5 2" 

-1 2 
0 1 
0 2 ] 

1 4 • 
0 1 


Solution. 


(a) The Inverse Matrix of A . 


Since A is an upper triangular matrix, the determinant of A is the product of diagonal entries. 
Thus we have det (A) — —1^0, and hence A is invertible. 

















To find the inverse using the formula, we first determine the cofactors C,y of A . 
We have 


Cn = 

C21 — 
C31 


1 2 
0 1 

5 2 
0 1 

5 2 

-1 2 


= - 1 , C12 = - 

= -5, C 22 = 

= 12, C 32 = ■ 


0 2 
0 1 


1 2 
0 1 

1 2 
0 2 


— 0, C13 — 

= 1, C 23 = - 

= -2, C 33 = 


0 -1 

0 0 

1 5 

0 0 

1 5 

0 -1 


The the adjoint matrix of A is 


Adj(A) = C T = 


-1 -5 12 

0 1-2 
0 0-1 


Using the formula, we obtain the inverse matrix 


A- 1 = 


det(A) 


Adj (A) = 


1 5 -12 

0-12 

0 0 1 


(b) The Inverse Matrix of B. 

To check the invertibility of the matrix B , we compute the determinant of B. 
The second column cofactor expansion yields that 


det(5) = 


1 2 
3 1 


= -5^0. 


So the matrix B is invertible. 


Now the cofactors C,y of B are 


Cn = 

C 21 = - 

C31 


1 4 
0 1 


0 2 
0 1 


0 2 
1 4 


= 1, C 12 = - 

= 0, C 22 = 

—2, C32 


0 4 
3 1 

1 2 
3 1 


= 12. Ci 3 = 


0 1 
3 0 


= -5, C 23 = - 


1 2 
0 4 


= — 4 , C33 


1 0 
3 0 

1 0 
0 1 


Hence the adjoint matrix of B is 


Adj (5) = C T = 


1 0 -2 
12 -5 -4 

-3 0 1 


It follows from the formula that the inverse matrix of B is 















































B~ l 


1 

det (B) 


Adj (B) = j 


-1 

-12 

3 


0 

5 

0 


2 

4 

-1 


Let A be a real symmetric matrix whose diagonal entries are all positive real numbers. Is it true that the all of the 
diagonal entries of the inverse matrix A~ 1 are also positive? If so, prove it. Otherwise, give a counterexample. 

Solution. 

The statement is in general false. We give a counterexample. 

Let us consider the following 2x2 matrix: 


The matrix A satisfies the required conditions, that is, A is symmetric and its diagonal entries are 
positive. 

The determinant det(A) = (1)(1) — (2) (2) = —3 and the inverse of A is given by 

-1/3 2/3 " 

2/3 —1/3 

by the formula for the inverse matrix for 2 X 2 matrices. 

This shows that the diagonal entries of the inverse matrix A~ l are negative. 





Let A be an n X n nonsingular matrix with integer entries. Prove that the inverse matrix A 1 
integer entries if and only if det (A) — =bl. 


contains only 



0 If B is a square matrix whose entries are integers, then the determinant of B is an integer. 


The inverse matrix of A can be computed by the formula 


A- 1 = -— Adjf4) 

det(^) JV ’ 


Proof. 

Let / be the n X n identity matrix. 

( is an integer matrix, thendet(^4) = ±1 

Suppose that every entry of the inverse matrix^ -1 is an integer. 
It follows that det (A) and det (A -1 ) are both integers. 














Since we have 


det(^4)det(^4 x ) = det (AA X ) = det(/) = 1, 


we 


must have det (A) = ±1. 


( *<= ): If det (/I) = ±1, then A 1 is an integer matrix 
Suppose that det(v4) = =bl. The inverse matrix ofv4 is given by the formula 


A~ x = 


det (A) 


Adj (A), 


where Adj (A) is the adjoint matrix of A . 
Thus, we have 


A~ l = ±Adj(A). 

Note that each entry of Adj (A) is a cofactor of A , which is an integer. 

(Recall that a cofactor is of the form ± det {My ), where My is the (z j) -minor matrix of A , hence 
entries of My are integers.) 

Therefore, the inverse matrix A ~ l contains only integer entries. 



^ Subspaces in R n 


Definition 


Let A be an m X n real matrix. 

A subset W in M n is called a subspace if W is a vector space in M n . 

Q The null space J\f(A) of A is defined by 

AT (A) — {x £ | Ax = 0 m }. 

The range 71 (A) of the matrix A is 

ft(A) = {y £ | y = Ax for some x £ M n }. 

The column space of A is the subspace of A m spanned by the columns vectors of A. 
The row space of A is the subspace of A n spanned by the rows vectors of A. 


Summary 


Let A be an m X n real matrix. 

(Subspace Criteria) A subset W in M n is a subspace of M n if and only if the following three 
condisions are met. 

(a) The zero vector 0 £ M n is in W. 

(b) If x, y £ W, then x + y £ W. 

(c) If x £ W and C £ R, then cx £ W. 

The nullspace of A is a subspace in R n . 

The range of A is a subspace in R m . 

Q The range of A is the column space of A. 

If the columns of A are linearly independent, then J\f (A) — {0}. 


Problems 

Each of the following sets are not a subspace of the specified vector space. For each set, give a reason why it is not 
a subspace. 

( 1 ) 


in the vector space R 3 . 

( 2 ) 


f 

Xi 


\ 

1 


e M 3 

IV 

o 

l 

_Z3_ 


> 




















< 

XI 

X2 

e M 3 

\ 

x\ — 4^2 + 5^3 — 2 > 


,X3_ 


> 


in the vector space M 3 . 

(3) 


S 3 = { 


X 

y J 


y = x 


in the vector space ] 


Solution. 


Recall the following subspace criteria. 

A subset FT of a vector space V over the scalar field K is a subspace of V if and only if the following 
three criteria are met. 


The subset W contains the zero vector of V. 


If u, v E W, then u + v 6 W. 


If u E W and a E K, then au 6 W. 


Thus, to prove a subset W is not a subspace, we just need to find a counterexample of any of the three 
criteria. 


Solution (1). Si = {x G R 3 I x 1 > 0} 

The subset S t does not satisfy condition 3. For example, consider the vector 


x = 


1 1 1 

0 

0 


Then since = 1 > 0, the vector x E S v Then consider the scalar product of x and the scalar -1. 
Then we have 


(-!)■* = 


0 

0 


and the first entry is -1, hence -x is not in S ] . Thus does not satisfy condition 3 and it is not a 
subspace of R 3 . 














(You can check that conditions 1, 2 are met.) 


Solution (2). = {x G R 3 | x 1 - 4x 2 + 5x 3 = 2} 

The zero vector of the vector space R is 


r o n 


o « 


o 

0 


Since the zero vector 0 does not satisfy the defining relation x 1 - 4x 2 + 5x 3 = 2, it is not in S 2 . Hence 
condition 1 is not met, hence S 2 is not a subspace of R 3 . 

(You can check that conditions 2, 3 are not met as well.) 


Solution (3). S 2 = {x E R 2 | y = x 2 } 
Consider vectors 


1 1 

1 


and 


r-i 

i 


These are vectors in S 2 since both vectors satisfy the defining relation y = x 2 
However, their sum 


■1 ■ 


■-1 ■ 


■o ■ 

1 

+ 

1 

— 

1 


2 

is not in S 2 since 1^0. 

Hence condition 2 is not met, and thus S 3 is not a subspace of R . 
(You can check that condition 1 is fulfilled yet condition 3 is not met.) 


An n X n matrix A is called orthogonal if A T A = I. 

Let V be the vector space of all real 2x2 matrices. Consider the subset 

W := {A E V | A is an orthogonal matrix}. 

Prove or disprove that W is a subspace of V. 


Solution. 

We claim that W is not a subspace of V . 

One way to see that W is not a subspace of V is to note that the zero vector O in V, which is the n X n 
zero matrix, is not in W as we have 0 T O = 0^1. 
















Thus, W is not a subspace of V . 

Another approach 

You may also show that scalar multiplication (or addition) is not closed in W . 

For example, the identity matrix / is orthogonal as / T / = / , and thus / is an element in W. 
However, the scalar product 2/ is not orthogonal since 

(2/) T (2/) =4/^/. 


^ Let A be an m X n matrix. Let Af(A) be the null space of A. Suppose that u E AT (A) and v E Af(A). Let 
w = 3u — 5v. Then find Aw. 


Hint. 

Recall that the null space of an m X n matrix A is a subspace of J$L n defined by 

Af(A) = {x E R n | Ax = 0 m }. 

Here 0 m is the m -dimensional zero vector in . 


Solution. 

Since u, v E J\f(A) , we have 


Am — 0 m and Av — 0 m , 


where 0 m is the m -dimensional zero vector in . 

Now using the properties of the matrix multiplication, we have 

Aw — ^4(3u — 5v) 

= ^4(3u) + A(— 5v) 

= 3Au — 5 Av 

— 30 jfi — 0 jfi . 


Therefore we obtained 


Aw — 0 m E . 

Remark. 

Note that a map T : R” R m defined by T (x) = Ax , where A is an m X n matrix is a linear 
transformation. 

That is the map T satisfies: 

Q T (u + v) = T (u) + T (v) for any u,vGR”, and 
Q T (cv) = cT (v) for any v E R n and c E R. 



From this point of view, the above problem can be classified into a problem of linear transformation. 


Let W be the subset of the 3-dimensional vector space R 3 defined by 


w= \ 


~ Xi 

X2 

e M 3 

2x\X2 — X3 > 


1 

_^3_ 


> 


(a) Which of the following vectors are in the subset W? Choose all vectors that belong to W. 



" 0 " 


"1" 


"3" 


O' 

.0. 


" 1 O A 


" 1 " 

(1) 

0 

. 0 . 

(2) 

2 

_2 

(3) 

0 

.0. 

(4) 

(5) 

1 A 4 

1 2 4 

(6) 

-1 

_ —2 


(b) Determine whether W is a subspace of R 3 or not. 


Solution. 

Problem 1 (a) Which of the following vectors are in the subset W ? 

First of all, each element in W is a 3-dimensional vector. Thus (4), (5) can not be elements of W . 

'O' 

, thenxi = 0,x 2 = 0 ,X 3 = 0 and these satisfy the defining relation 2x\X2 = X 3 . Thus 


If x = 


is in W . 


If x = 


, then we have x 1 = 1, x 2 = 2, x 3 = 2, and thus 2x ix 2 — 4 but x 3 = 2 . Hence 


2 xiX 2 7 ^ X 3 , and 


is not in W. For (3), we have 2x iX 2 = 0 = x 3 , and thus 


is in W . For 


1 

-1 

-2 


is in W . 


( 6 ), we have 2 xiX 2 = —2 = X 3 , and thus 
In summary, the vectors (1), (3), ( 6 ) are in W . 

o 

Problem 1 (b) Determine whether W is a subspace of R or not. 
We claim that W is not a subspace. For example, consider the vectors 


" 3 " 

0 

and 

■ 1" 

-1 

. 0 . 


.-2. 


In part (a), we showed that these vectors are in W . 
However, the sum 


'3' 


■ 1' 


■ 4' 

0 

+ 

-1 

= 

-1 

.0. 


.-2. 


.-2. 









































is not in W . 

In fact, ifxi = 4,X2 = — 1,X3 = —2, then 2x\X2 = —8 7 ^ — 2 = X3 . 

Thus the sum does not satisfy the defining relation for W , hence not in W . 

Therefore, the subset W is not closed under addition, hence W is not a subspace of R 3 . 

Solution of Problem 2 

We prove that IF is a subspace of R 3 . To do this, we check the following subspace criteria. 


Subspace Criteria 

(a) The zero vector 0 G R 3 is in W . 

(b) If x, y G W , then x + y G W . 

(c) If x G W and cGR, then cx G W . 


Consider the zero vector 0 


0 

0 

0 


€ M 3 . Then x i 


0 , x 2 = 0 , x 3 = 0 satisfy the defining relations 


x 1 = 3 x 2 and x 3 = 0. Thus the zero vector 0 is in W, hence condition (a) is satisfied. 



Xi 


~y 1" 

To check condition (b), let us take x = 

x 2 

and y = 

72 


_*3_ 


.73. 


sum x + y is in IF 
Since x G IF , we have 


from IF . Then we want to show that the 


xi = 3x2 andx3 = 0. 


Also, since y G IF , we have 


y 1 = 3 y 2 and 73 = 0. 


From these, we see that 

xi +yi = 3 x 2 + 3 y 2 = 3 (x 2 +72) andx 3 +73 = 0. 


Or simply we have 


xi +71 = 3 (x 2 +72) andx 3 +73 = 0. 


Note that this implies that the sum 


x + y 


xi +71 

X 2 +72 
_*3 +F3_ 


satisfies the defining relation, and the sum is in IF . Thus condition (b) is satisfied. 










Finally, to prove condition (c), take x = 

product cx G W . 

Since xG W , we have 


xi 

*2 


from W and let cGK. We need to show that the scalar 


*3 


xi — 3x 2 andx 3 = 0. 


Multiplying these by c, we have 

cxi = 3(cx 2 ) and cx 3 = 0. 


These equalities tells that 
the scalar product 


cx = 


CX 1 
CX 2 

\_CX 3 J 


satisfies the defining relation and hence it is in W . Thus condition (c) is met as well. 
Since we have checked all subspace criteria, W is a subspace of R 3 . 




Let A — 


12 1 
3 6 4 


and let 


"-3" 


'-2" 



1 

, b = 

1 

5 C 

1" 





_1_ 

_ 1 _ 


. 0 _ 




For each of the vectors a, b, c, determine whether the vector is in the null space AT (A). Do the same for the 
range 11(A). 


Solution of (a) 

Let us first recall the definitions of the null space and range. The null space of the 2 X 3 matrix A is 
defined by 

A f(A) = {xg1 3 Mx = 0} 


and the range of A is defined by 

7 Z(A) — {y E M 2 | Ax = y for some x E M 3 }. 

Note that the null space is a subspace of M 3 and the range is a subspace of R 2 . 


Consider first the vector a = 

Also, since we have 


-3 

1 

1 


. Since the vector a is three-dimensional, it cannot be in the range. 



■-3' 


12 1 




0 



1 

= 


3 6 4 




1 


. 1 . 



o, 























the vector a is not in the null space of A . 


Next, we consider the vector b = 

range 1Z(A). 

Since we have 


-2 

1 

0 


. Again, this is three-dimensional vector, hence b is not in the 


A b = 


we see that the vector b is in the null space Af(A) . 



'-2" 


12 1 




0 



1 

= 


.3 6 4. 




.0. 



. 0 . 




= 0 , 


Since it is two-dimensional, the vector c is not in the null space 


Finally, we consider the vector c = 

M(A). 

Let us determine whether c is in the range 71(A) . To do this, we check whether the system Ax = c is 
consistent or not. The augmented matrix of the system is 


[A | c] = 

Applying the elementary row operations, we have 


1 

2 

1 

1 

3 

6 

4 

1 


1 

2 

1 

1 

R 2 -3Ri 

l 

2 

1 

1 

Rl—R-2 

1 

2 

0 

3 " 

1 - 

co 

6 

4 

1 

- > 

0 

0 

1 

-2 

- * 

0 

0 

1 

-2 


and thus it has the general solution 


x i — — 2x2 + 3 
x 3 = -2. 


Therefore, the vector c is in the range 7Z(A) . 

" 1 " 

(For example, if xo = 1 , then^4xo = c .) 

_ — 2 _ 

In summary, the vector a is neither in J\f {A) or in 7Z (A) . The vector b is in Af (A) but not in 7Z(A) , 
The vector c is not in Af {A) but in 71(A) . 

Solution of (b) 

We first describe the null space Af(B) of the matrix B . 
x\ 

x 2 is in the null space Af (B) if and only if Bx — 0 . 

_*3_ 

The augmented matrix of Bx = 0 is 


The vector x = 






























1 

1 

2 

1- 

O 

1- 

1 

to 

-2 

-4 

-1 

O 


Applying the elementary row operations, we have 


1 

1 

2 

0 

R 2 + 2 i? 1 
-> 

1 

1 

2 

0 

-2 

-2 

-4 

0 


0 

0 

0 

0 


and the general solution is 


Therefore, the vector x 
Thus, 


xi 

*2 

,*3j 


Xi = —x 2 - 2 x 3 . 

is in the null space if and only if x \ = —x 2 — 2 x 3 . 


N{B) = {x = 


xi 

X 2 

l_*3j 


G R 3 | xi = —x 2 — 2 x 3 } 


= {x e R 3 | x = 


x 2 — 2 x j 
x 2 
X3 


{x G R 3 | x = x 2 

'-1" 

1 

+ x 3 

1 

O 1 

to 


0 


_ 1 _ 


Span 


r 



1 

1 

to 

\ 

j 

1 

5 

0 

> 

l 

. 0 . 


. 1 . 

J 


Thus, the set 



is a spanning set of the null space J\f{B) . 

Let us verify that S is a linearly independent set. Suppose that we have 


Cl 

1 

+ C 2 

'- 2 ' 

0 


0 


1 


for some scalars C\ , C2 . Then we have 


" -Cl - 2 c 2 " 


'o' 

Cl 

= 

0 

C2 


. 0 . 


and thus C\ — C2 — 0 and S is linearly independent. 

Hence S is a linearly independent spanning set, and thus S is a basis of the null space M (B) . 







































Let W be the subset of M 3 defined by 


w= < 

X — 

"®l" 

X2 

e M 3 

\ 

X\ — 3x2 and £3 = 0 > 


< 

_^3_ 


> 


Determine whether the subset W is a subspace of M 3 or not. 


Solution. 

Problem 1 (a) Which of the following vectors are in the subset W ? 

First of all, each element in W is a 3 -dimensional vector. Thus ( 4 ), ( 5 ) can not be elements of W . 

'O' 

, thenxi = 0 ,x2 = 0,X3 = 0 and these satisfy the defining relation 2 x 1X2 = X3. Thus 


If x = 


is in W . 


If x = 


, then we have x 1 = 1, x 2 = 2, x 3 = 2, and thus 2x \X 2 = 4 but x 3 = 2 . Hence 

2x]X2 7 ^ X 3 , and 


is not in W . For ( 3 ), we have 2 x iX 2 = 0 = x 3 , and thus 

1 


is in W . For 


is in W . 


(6), we have 2 xiX 2 = —2 = X3 , and thus 
In summary, the vectors ( 1 ), ( 3 ), (6) are in W . 

o 

Problem 1 (b) Determine whether FT is a subspace of M or not. 
We claim that W is not a subspace. For example, consider the vectors 


"3" 

0 

and 

■ 1 ' 

-1 

.0. 


.- 2 . 


In part (a), we showed that these vectors are in W . 
However, the sum 


'3' 


■ 1' 


‘ 4 ' 

0 

+ 

-1 

= 

-1 

.0. 


_ — 2 _ 


_ — 2 _ 


is not in W . 

In fact, ifxi =4,X2 = — 1,X3 = — 2 , then 2x]X2 = —8 7^ —2 = X3. 

Thus the sum does not satisfy the defining relation for W , hence not in W . 

Therefore, the subset W is not closed under addition, hence W is not a subspace of R 3 

Solution of Problem 2 






























We prove that W is a subspace of R 3 . To do this, we check the following subspace criteria. 


Subspace Criteria 

(a) The zero vector 0 G R 3 is in W . 

(b) If x, y G W , then x + y G W . 

(c) If x G W and cGR, then cx G W . 


Consider the zero vector 0 = 


G R' 5 . Then x \ 


x i = 3 x 2 and x 3 = 0. Thus the zero vector 0 is in 


= 0 , x 2 = 0 , x 3 = 0 satisfy the defining relations 
W , hence condition (a) is satisfied. 


To check condition (b), let us take x 

sum x + y is in W 
Since x G W , we have 


"xi" 


Ti" 

X 2 

and y = 

72 

_*3_ 


.73. 


from W . Then we want to show that the 


xi = 3x2 andx3 — 0. 


Also, since y G W , we have 


y 1 = 3^2 andy 3 = 0. 


From these, we see that 

+yi = 3x 2 + 3y2 = 3 (x 2 +T 2 ) andx 3 +y 3 = 0. 


Or simply we have 


Xi +yi = 3(x 2 + y 2 ) andx 3 +y 3 = 0. 


Note that this implies that the sum 


x + y = 


xi +y i 

X2 +T2 
.^3 +Ts_ 


satisfies the defining relation, and the sum is in W . Thus condition (b) is satisfied. 


Finally, to prove condition (c), take x = 

product cx G W . 

Since x G W , we have 


xi 

*2 


from W and let cGR. We need to show that the scalar 


L*3J 


xi = 3x2 andx3 = 0. 












Multiplying these by c, we have 


These equalities tells that 
the scalar product 


cx i = 3 (cx 2 ) and cx 3 = 0. 


cx = 


cx 1 
cx 2 
cx 3 


satisfies the defining relation and hence it is in W . Thus condition (c) is met as well. 
Since we have checked all subspace criteria, W is a subspace of IR 3 . 


Let 



" 1 2 2 


"1 2 2" 

A = 

2 3 2 

and B — 

2 3 2 


-1 -3 -4 


5 3 3 


Determine the null spaces of matrices A and B. 


Proof. 

The null space of the matrix A 

We first determine the null space of the matrix A . 

By definition, the null space is 

AT (A) := {xGR 3 I Ax — 0 }, 

that is, the null space of A consists of the solution x of the linear system Ax — 0 . 


To solve the system Ax — 0 , we apply the Gauss-Jordan elimination. We reduce the augmented matrix 
\A 10] by elementary row operations as follows. 


1 

2 

2 

0 ' 

R2-2R1 

R3+R1 

" 1 

2 

2 

1 - 

O 

2 

3 

2 

0 

- > 

0 

-1 

-2 

0 

-1 

CO 

1 

-4 

0 


0 

-1 

-2 

0 


" 1 

0 

-2 

1 

O 

R 2 

" 1 

0 

-2 

0" 

0 

-1 

-2 

0 

- > 

0 

1 

2 

0 

0 

0 

0 

0 


0 

0 

0 

0 




The last matrix is in reduced row echelon form, and the solutions x = 


Xi 

*2 


ofv 4 x 


L*3j 


0 satisfy 


x\ — 2x3 andx 2 = — 2x 3 


and x 3 is a free variable. 
Thus we have 



























2x 3 


' 2 ' 

X = 

-2x 3 

= *3 

-2 


. *3 . 


. 1 _ 


for any x 3 are solutions. 

Therefore the null space of the matrix A is 


r 


' 2 ' 

\ 

x G R 3 

X = x 3 

-2 

for any x 3 G R > 

l 


. 1 . 

j 


= Span 


r 2 

-2 

\L 1 j 


■ 2 ■ 

the subspace of 1 R 3 spanned by the vector —2 

_ 1 

The null space of the matrix B 

The same procedure works for the matrix B . Thus we omit some detail below. 
For the matrix B , the augmented matrix is reduced to 


1 

2 

2 

0 " 



" 1 

0 

0 

0 " 

2 

3 

2 

0 

-)► • ■ 

■ • ->► 

0 

1 

0 

0 

5 

3 

3 

0 



0 

0 

1 

0 


(Check the computation by yourself.) 

This implies that x = 0 is the only solution of the system 5x = 0 . 

Therefore, the null space J\f (B) of the matrix B consists of just the zero vector: 


r 

'o' 

\ 

•a m = \ 

0 

> 

{ 

. 0 . 

> 


Comment. 

The dimension of the null space AT {A) is 1, and the dimension of the null space J\f (B) is 0. 

In other words, the nullity of the matrix A is 1 , and the nullity of the matrix B is 0 . 

(Recall that the nullity of a matrix is just the dimension of the null space of the matrix.) 

<D Fix the row vector b = [ — 1 3 — 1 ], and let R 3 be the vector space of 3 X 1 column vectors. Define 

W = {v e M 3 I bv = 0}. 

Prove that W is a vector subspace of R 3 , and find a basis for W ■ 


Proof. 

























2 

We verify the subspace criteria: the zero vector 0 of R is in W , and W is closed under addition and 
scalar multiplication. 


First, the zero element in R 3 is 0, the 3 X 1 column vector whose entries are all 0. Then clearly bO = 0 , 
and so 0 G W . 


Next, suppose v, w £ W , and c G R. Then bv = bw = 0 , and so 

b(v + w) = bv + bw = 0. 


Thus, v + w G W . 


Because, again, bv = 0 , we have 

b(cv) = cbv = cO = 0. 

Thus cy G W . These three criteria show that W is a vector subspace of R 3 . 

Comment. 

We can generalize the problem with an arbitrary 1x3 row vector b . 

The proof is almost identical. 

(Look at the proof. We didn’t use components of the row vector b = [ — 1 3 — 1 ] .) 

Note that vectors u,vGR is said to be perpendicular if 

rp 

u • v = u v = 0. 

Thus, the result of the problem says that for a fixed vector uGR 3 , the set of vectors v that are 

perpendicular to u is a subspace in R . 

i T 

(Note that we appy the problem to b = u .) 

Let W be the subset of M 3 defined by 


w= < 

X — 

~ Xi 

X2 

e M 3 

5xi — 2x2 +^3=0 > 


< 

_^3_ 


> 


Exhibit a 1 X 3 matrix A such that W = Af(A), the null space of A. Conclude that the subset W is a subspace 
of M 3 . 


Solution. 

Note that the defining equation 5xi— 2x2 +V3 = 0 can be written as 








[ 5 -2 11 


*i 
X 2 
,*3, 


- 0. 


Hence if we put A — [ 5 —2 1 ] , then the defining equation becomes 

Ax = 0. 


Hence, we have 

W = {x e R 3 \Ax = 0 }, 

which is exactly the null space of the 1 X 3 matrix A . 

Hence we have proved that W = Af(A) . 

In general, the null space of an m X n matrix is a subspace of the vector space R” . 

(See the post The null space (the kernel) of a matrix is a subspace of WL n .) 

Since we showed that W is the null space of 1 X 3 matrix A , we conclude that W is a subspace of R . 

Let V be a subset of the vector space R n consisting only of the zero vector of M n . Namely V = (0). Then 
prove that V is a subspace of M n . 


Proof. 

To prove that V — {0} is a subspace of J$L n , we check the following subspace criteria. 


Subspace Criteria 

(a) The zero vector 0 £ WL n is in V. 

(b) If x, y £ V , then x + y £ V . 

(c) If x £ V and c £ R, then cx £ V . 


Condition (a) is clear since V consists of the zero vector 0. 

To check condition (b), note that the only element in V = {0} is 0. Thus if x, y £ V , then both x, y 
are 0. Hence 

x + y = 0 + 0 = 0£ V 


and condition (b) is met. 

To confirm condition (c), let x £ V and c £ R. Then x = 0 . 
We have 


cx = cO = 0 £ V 


and condition (c) is satisfied. 

Hence we have checked all the subspace criteria, and hence the subset V — {0} consisting only of the 
zero vector is a subspace of WL n . 





What’s the dimension of the zero vector space? 

What’s the dimension of the subspace V = {0} ? 

The dimension of a subspace is the number of vectors in a basis. So let us first find a basis of V. 

Note that a basis of V consists of vectors in V that are linearly independent spanning set. Since 0 is the 
only vector in V, the set S — {0} is the only possible set for a basis. 

However, S is not a linearly independent set since, for example, we have a nontrivial linear combination 

1-0 = 0 . 

Therefore, the subspace V — {0} does not have a basis. 

Hence the dimension of V is zero. 

^ Prove that the null space AT (A) is a subspace of the vector space M n . 


Proof. 

Subspace criteria 

We check the following criteria for Af(A) to be a subspace of M” . 


(a) The zero vector 0^ E K” is in Af {A) . 

(b) If x, y £ Af(A) , then x + yE AT (A) . 

(c) If x £ Af(A) and cEM, then cx £ Af(A) . 


Check (a) 

Since we havev40^ = 0 m , the zero vector 0^ £ R” is in Af(A) . Thus (a) is met. 

Check (b) 

For (b), take arbitrary vectors x, y £ Af(A) . 

By the definition of AT {A) , we have 

Ax. — 0 m and Ay = 0 m . (* 

We want to show that x + y £ Af (A) . Thus we need to showv4(x + y) = 0 . 

This equality can be proved as follows. 

We have 

A(x + y) = Ax + Ay 

= 0 + 0 = 0 . 


Here the second equality follows from (*). 
Thus x + y £ Af {A) and (b) is satisfied. 



Check (c) 

To check (c), take arbitrary x E A/"(v4) and a scalar cGK. 
Since x E J\f (A) , we have 

Ax = 0 m . 

Multiplying by the scalar c , we have 


cAx = cO /77 . 


This is equivalent to 

v4(cx) = 0 m . 

This equality implies that cx E A/* (A) , hence (c) is met. 

Thus we checked the conditions (a), (b), and (c) and conclude that the null space Af(A) is a subspace of 

R\ 


G Prove that every plane in the 3-dimensional space R 3 that passes through the origin is a subspace 70)3 

Proof. 


Each plane P in M 3 through the origin is given by the equation 


ax + by + cz = 0 


for some real numbers a, c. 

That is, the plane P is a set of vectors 


P = 


satisfying the equation ax + by + cz — 0: 


r 

X 


\ 

j 

y 

e R 3 

ax + by + cz — 0 > 

l 

_z_ 


j 


Now the equation can be written as the matrix equation 

Ax — 0, 

where A is the 1 X 3 matrix A, x E R 3 , and 0 is the 1 -dimensional zero vector given by 


A—\a b cl ,x = 


, 0 =[ 0 ], 


Thus, the plane can be written as 

P — {x G M 3 I Ax = 0} 










and this is the definition of the nullspace Af(A) of A. That is 

P = J\f(A). 


Therefore, the plane P is the nullspace of the 1 X 3 matrix A. 

Since the nullspace of a matrix is always a subspace, we conclude that the plane P is a subspace of R 
Therefore, every plane in R 3 through the origin is a subspace of R 3 . 


0 


Let S be the subset of M 4 consisting of vectors 

set S' is a subspace of M 4 . 

(The Ohio State University) 


satisfying 2x + 3y + 5z + 7w = 0. Then prove that the 


Proof. 

First, in set theoretical notation, the definition of S can be written as 


S = 


/ 

X 


\ 

< 

y 

z 

G E 4 

2x + 3y + 5z + 7w = 0 

< 

_w _ 


J 


Let A = [ 2 3 5 7 ] be the 1 X 4 matrix. Then the defining equation 

2x + 3y + 5z + 7w — 0 can be written as 


Ax. = 0, 

where 


x 


x = 


y 

z 

w 


It follows that the set S is the null space of A, that is, s = A f(A). 

Since every null space is a subspace, we see that S is also a subspace of R 4 


0 

(a) Let S be the subset of M 4 consisting of vectors 


x 

y 

z 


satisfying 


w 


2x + Ay + 3z + 7w + 1 = 0. 















Determine whether S is a subspace of R 4 . If so prove it. If not, explain why it is not a subspace. 

x 


(b) Let S be the subset of M 4 consisting of vectors 


satisfying 


z 

_w _ 

2x + % + 3z + 7w = 0. 

Determine whether S is a subspace of M 4 . If so prove it. If not, explain why it is not a subspace. 
(The Ohio State University) 


Solution. 

(a) 2 x 4 y -\- 3 z 7w 1 = 0 

We claim that S is not a subspace of R 4 . 

If S is a subspace of R 4 , then the zero vector 0 = 


in R 4 must lie in S . 


However, the zero vector 0 does not satisfy the equation 

2x + 4y + 3z + 7w + 1 = 0. 
So 0 ^ S , and we conclude that S is not subspace of R 4 . 

(b) 2x + 4 y + 3 z + 7w = 0 

In a set theoretical notation, we have 



/ 

X 


\ 



y 

z 

G R 4 

2x T - 4 y T - 3 z 1 w = 0 > 

1 

\ 

_ w _ 


> 


Let A be the 1x4 matrix defined by 

A — [2 4 3 7]. 
Then the equation 2x + Ay + 3z + 7w = 0 can be written as 


A 


= 0 . 


So we have 


t 

X 



X 

\ 

i 

y 

z 

G R 4 

A 

y 

z 

= 0 > 

l 

_w_ 



_ w _ 

> 


= M(A), 




















the null space of A . 

Recall that the null space of a matrix is always a subspace. 

Hence the subset S is a subspace of R 4 as it is the null space of the matrix A . 

Let R 2 be the x-y- plane. Then R 2 is a vector space. A line <Ct 2 with slope m and 7/-intercept b is defined by 

£ — {(cc, y) E R 2 | y — mx + b}. 

Prove that i is a subspace of R 2 if and only if b — 0. 

Proof. 

We must prove two statements. First we show that if b ^ 0, then l is not a subspace. Then we show 
that if b = 0, then £ is a subspace. 

In order for £ to be a subspace, it must contain the zero element (0, 0). Plugging this point into the 
defining equation yields b = 0. Thus if b ^ 0, then £ cannot be a subspace. 


Now we prove that if b = 0, then £ is a subspace. We have already shown that (0, 0) E £. Now 
suppose we have two points (x 1? >q), (.* 2 ^ 2 ) e Then we have 

Jl +y 2 = + = m ( x \ + *2) 

and so (xj + x 2 ,> ; 1 +T 2 ) contained in £. 


Finally for c E R we must show that c(x^y^) = (cx^cy^) lies in £. We check 

cyi = c(mx |) = m(cxj), 

and so (cx 1? cyj) E f. This proves that if b = 0, then £ is a subspace. 




Let S' be the following subset of the 3-dimensional vector space R 3 . 


s = < 

( 

x e M 3 

X — 

1 

to 1 — 1 

\ 

,*1,*2,0:3 G Z > 


< 



> 


where Z is the set of all integers. Determine whether S is a subspace of R 3 . 


Proof. 

We claim that S is not a subspace of R 3 . 

If S is a subspace, then S is closed under scalar multiplication. 
But this is not the case for S . 







For example, consider x 


1 

1 

1 


Since all entries are integers, this is an element of S. 

Let us compute the scalar multiplication of this vector x and the scalar 1/2 £ R. We have 


1 

2 


• x = 


1 

2 


' 1 " 


' 1 / 2 ' 

1 

= 

1/2 

. 1 . 


. 1 / 2 . 


Since 1/2 is not an integer, the scalar multiplication is not in S. Therefore the subset S cannot 
be a subspace of i? 3 . 


Let a and b be fixed vectors in M 3 , and let W be the subset of M 3 defined by 

W = {x £ R 3 | a T x — 0 and b T x — 0}. 
Prove that the subset W is a subspace of M 3 . 

Proof. 

We prove the following criteria for the subset W to be a subspace of R . 

(a) The zero vector 0 £ R 3 is in W . 

(b) If x, y £ W, then x + y £ W. 

(c) If x £ W and c £ R, then cx £ W. 


For (a), note that a T 0 = 0 and b T 0 = 0. Thus the zero vector 0 £ R 3 is in W. 


To check (b), let x, y £ W. Then we have the following relations. 

a T x = 0 and b T x = 0, and a T y = 0 and b T y = 0. (*) 

To show that x + y £ W, we need to show that 

a T (x + y) = 0 and b T (x + y) = 0. 

We first compute 

a (x + y)=a x + a y 
= 0 + 0=0 


by the relations (*). 
Similarly, we have 









b T (x + y) = b T x + b T y 
= 0 + 0=0 


by the relations (*). 

Thus the vector x + y satisfies the defining relations for W, hence x + y G W. 


Finally, to prove (c), let x G W and let cGR. 
Since x G W, we have 


a T x = 0 and b T x = 0. 

Multiplying by the scalar c from the left, we obtain 

a T (cx) = 0 and b T (cx) = 0. 

(Note that since c is a scalar, we can switch the order of the product of c and a . Same for c and 


b T .) 


These equalities proves that the vector cx satisfies the defining relation for W . Thus cx G W. 


Therefore the criteria (a)-(c) are all met, and we conclude that W is a subspace of IKl . 




Let U and V be subspaces of the n-dimensional vector space M n . Prove that the intersection U Pi V is also a 
subspace of R n . 


Definition (Intersection). 


Recall that the intersection U D V is the set of elements that are both elements of U and V . 
In the set theoretical notation, we have 

U HV — {x | x G £/ and x G V}. 


Proof. 

To prove that the intersection U Pi V is a subspace of WL n , we check the following subspace criteria: 


The zero vector 0 of WL n is in U Pi V. 


For all x, y G U fl V , the sum x + yGfflf. 


^ For all x G U D V and r G R, we have rx G U Pi V . 



As U and V are subspaces of WL n , the zero vector 0 is in both U and V . 
Hence the zero vector 0 G R” lies in the intersection U D V. 

So condition 1 is met. 


Suppose that x, y £ U D V . 

This implies that x is a vector in U as well as a vector in V . 

Similarly, y is a vector in U as well as a vector in V . 

Since U is a subspace and x and y are both vectors in U, their sum x + y is in U . 
Similarly, since V is a subspace and x and y are both vectors in V, their sum x + y G V . 

Therefore the sum x + y is a vector in both U and V . 

Hence x + y G U fl V . 

Thus condition 2 is met. 


To verify condition 3, let x G= U Pi V and rGM. 

As x G U D V , the vector x lies in both U and V . 

Since both U and V are subspaces, the scalar multiplication is closed in U and V , respectively. 

Thus rx G U and rx G V . 

It follows that rx G U D V . 

This proves condition 3, and hence the intersection U H V is a subspace of WL n . 

$ Determine whether the following is true or false. If it is true, then give a proof. If it is false, then give a 

counterexample. Let W\ and W2 be subspaces of the vector space M n .If B\ and B2 are bases for W\ and W2, 
respectively, then B\ Pi B2 is a basis of the subspace W\ D W2. 


Solution. 

The statement is false. We give a counterexample. 

Let us consider the vector space R 2 , the plane. 

Let W 1 and W 2 be R 2 itself. Then they are subspaces of R 2 . 

Let 

HE’ 

be bases of W\ and W2 , respectively. 

(Note. B 1 consists of two linearly independent vectors in the 2-dimensional vector space W \ — R 2 , 
hence B 1 is a basis ofW\. Similarly, B 2 is a basis of W 2 .) 


1 } and52 ={ L 1 .’ -1} 












Since W\ and W 2 are both R 2 , we have W\ D W 2 — R 2 . 

However, the intersection B\ Pi B 2 is the empty set, and the empty set is not a basis of 
W 1 P W 2 — R 2 • Thus, we have found a counterexample to the statement. 

0 Let U and V be subspaces of the vector space M n . If neither U nor V is a subset of the other, then prove that the 
union U U V is not a subspace of M n . 


Proof. 

Since U is not contained in V, there exists a vector u G U but u ^ V . 

Similarly, since V is not contained in U, there exists a vector v G V but v ^ U . 


Seeking a contradiction, let us assume that the union is U U V is a subspace of WL n . 
The vectors u, v lie in the vector space U U V. 

Thus their sum u + v is also in U U V. 

This implies that we have either 


u + vG f/oru + vG V. 

If u + v G U , then there exists u / G U such that 

U + V = uf 

Since the vectors u and lP are both in the subspace U, their difference u f — u is also in U. Hence we 
have 


v = u' - u G U. 

However, this contradicts the choice of the vector v ^ U. 


Thus, we must have u + v G V . 

In this case, there exists v ; G V such that 

U + V = vf 


Since both v, v' are vectors of V, it follows that 

u = v' - v G Vj 


which contradicts the choice of u ^ V. 


Therefore, we have reached a contradiction. Thus, the union U U V cannot be a subspace of R” . 
Related Question. 

In fact, the converse of this problem is true. 



Problem. Let W \, W 2 be subspaces of a vector space V. Then prove that W\ U W2 is a subspace 
of V if and only ifJV\ C W2 or W2 C W 


For a proof, see the post “Union of Subspaces is a Subspace if and only if One is Included in Another 44 . 


€> Let XV 1 , XV2 be subspaces of a vector space V. Then prove that XVi U XV2 is a subspace of V if and only if 
Wi c XV2 or XV 2 C Wi- 


Proof. 

If W1 U W 2 is a subspace, then W\ C W 2 or W 2 C W\. 

( =>* ) Suppose that the union W\ U W2 is a subspace of V . 

Seeking a contradiction, assume that W\ (jL W2 and W2 W\. 

This means that there are elements 

x £ Wi \ W 2 and y G W 2 \ W x . 


Since W 1 U W2 is a subspace, it is closed under addition. Thus, we have x + y £ Wi U W2 - 
It follows that we have either 


x+jG Wi or x+y e W 2 . 
Suppose that x + y G W 1 . Then we write 


y = h+j) -x. 


Since both x + y and x are elements of the subspace W \, their difference y — (x + y) — x is also in 
W 1. However, this contradicts the choice ofy G W2 \ JVi . 


Similarly, when x + y G W2 , then we have 

x = (x+y) -y G W 2, 

and this contradicts the choice ofxG J¥ 1 \ W 2 - 

In either case, we have reached a contradiction. 

Therefore, we have either W\ C W2 or W2 C W 1. 

If fFl C W2 or W2 C then JFi U ^2 is a subspace. 

( ) If we have W\ C 1 ^ 2 , then it yields that W\U W2 = W2 and it is a subspace of U. 

Similarly, if W2 C IFi, then we have W\ U W2 — W2 and it is a subspace of U. 

In either case, the union W\ U W2 is a subspace. 



Let A and B be n X n matrices. Then prove that 

CN{A + B). 


De fi nition. 

Recall that the null space (or kernel) of an n X n matrix is 

A f(A) = {x e P I Ax = 0 }. 

The null space (A/") is a subspace of the n -dimensional vector space M” . 
Proof. 

Let x be an arbitrary vector in the intersection Af(A) DA f (B) . 

Then the vector x belongs to both AT {A) and Af (B) . 

Thus, by definition of the null space, we have 

Ax = 0 and Bx — 0. 

If follows from these equalities that we have 

(A + B)x = Ax + Bx = 0 + 0 = 0. 


Hence x lies in the null space of the matrix A + B , that is, x £ Af {A + B) . 
Since x is an arbitrary element of Af(A) nAf(B) , we have shown the inclusion 

AT (A) n Af(B) c Af(A+B), 


as required. 



Bases and Dimension of Subspaces in R n 


Definition 


Let V be a subspace in M n . 

A basis for V is a linearly independent spanning set for V . 

The number of vectors in a basis for V is called the dimension of V. 

The dimension of the range TZ(A) of a matrix A is called the rank of A. 

The dimension of the null space Af(A) of a matrix A is called the nullity of A. 


Summary 


A basis is not unique. 

The rank-nullity theorem: 

(Rank of ^ 4 )+(Nullity of 2 l)=(The number of columns in A). 
The rank of A and the rank of A T are the same. 


Problems 

QLet 


a 


" 1 " 


" 2 " 

b 

j v i = 

2 

j v 2 = 

-1 

_ c _ 


_ 0 _ 


_ 2 _ 


Find the necessary and sufficient condition so that the vector v is a linear combination of the vectors Vi , V2 . 


Solution 1. (Use the range) 


The question is equivalent to finding the condition so that the vector v is in the range of the matrix 


^ = hp v 2 ] 


1 2 

2 -1 

0 2 


The vector v is in the range 7 Z(A) if and only if the system Ax = v is consistent. 

We reduce the augmented matrix of the system by elementary row operations as follows. 
























■ 1 

2 

a " 

* 2 - 2 *i 

"1 

2 

a " 

2 

-1 

b 

-► 

0 

-5 

b - 2a 

0 

2 

c 


0 

2 

c 



* 2 

+ 3*3 1 

2 


a " 




0 

1 

b- 

2 a + 3 c 


0 2 


* 3 - 2*2 


2 a 

1 b - 2a + 3c 
0 4a - 2b - 5c 


The last matrix is in echelon form and the system is consistent if and only if 4a - 2b - 5c = 0. 
Therefore, the condition that v be a linear combination of v 1? v 2 is 4a - 2b - 5 c = 0. 

Solution 2. (Use the cross product) 

Note that the vectors v 1? v 2 spans a plane P in R 3 . 

Thus, the vector v is a linear combination of v 1? v 2 if and only if v lies on the plane P. 

The cross product 


2 -1 

0 2 



'I ' 


1 2 1 

Vi X v 2 = 

2 

0 

X 

-1 

2 


1 2 
0 2 


' 4 

-2 

-5 


1 2 
2 -1 


is perpendicular to the plane P. 

Therefore, the vector v is on the plane if and only if the dot (inner) product 

v ■ ( v i x v 2 ) = 0. 


Namely, 



= 4a - 2b - 5c = 0, 


and we obtained the same condition as in Solution 1. 


A 

Find a basis of the null space of the matrix B 


1 1 2 
-2 -2 -4 






























Solution of (a) 

Let us first recall the definitions of the null space and range. The null space of the 2 x 3 matrix A is 
defined by 

MA) = {x € R 3 Ml = 0} 

and the range of A is defined by 

7 Z(A) = {y e R 2 |dx = y for some x 6 R 3 }. 

Note that the null space is a subspace of R 3 and the range is a subspace of R 2 . 


Consider first the vector a 


Also, since we have 


r-3 


1 

1 


Since the vector a is three-dimensional, it cannot be in the range. 


A a 


'I 2 1 " 

3 6 4 



*0, 


the vector a is not in the null space of A. 


Next, we consider the vector b 



. Again, this is three-dimensional vector, hence b is not in the 


range 7 1(A). 
Since we have 


A b 


‘1 

3 


2 1 1 
6 4 


-2 

1 

0 



= 0 , 


we see that the vector b is in the null space A f(A). 


Finally, we consider the vector c 


■1 ' 

1 . Since it is two-dimensional, the vector c is not in the null 


space A /(A). 

Let us determine whether c is in the range 7 Z(A). To do this, we check whether the system Ax = c is 
consistent or not. The augmented matrix of the system is 


\4 I c] = 


■ l 

3 


2 

6 


1 

4 


1 

1 


Applying the elementary row operations, we have 






















■1 

2 

1 

1 " 

R 2 -3Ri 

" 1 

2 

1 

1 ' 

R 1 R 2 

■1 

2 

0 

3 " 

3 

6 

4 

1 


0 

0 

1 

-2 


0 

0 

1 

-2 


and thus it has the general solution 


Xj = - 2x 2 + 3 
X 3 = - 2. 


Therefore, the vector c is in the range 7Z(A). 


(For example, if x Q = 


1 

1 

-2 


, then Ax 0 = c.) 


In summary, the vector a is neither in Af(A) or in 7 Z(A). The vector b is in Af{A) but not in 7 Z(A). The 
vector c is not in Af(A) but in 7 Z(A). 


Solution of (b) 

We first describe the null space Af{B) of the matrix B. 


The vector x = 


is in the null space Af(B) if and only if Bx = 0. 


The augmented matrix of Bx = 0 is 


r 1 1 2 01 

-2 -2 -4 0 


Applying the elementary row operations, we have 


" 1 

1 

2 

0" 

R 2 +2R r 

"I 

1 

2 

0" 

-2 

-2 

-4 

0 


0 

0 

0 

0 


and the general solution is 


Xj = - x 2 ~ 2x 3 . 


Therefore, the vector x = 


is in the null space Af(B) if and only if Xj = 


- x 2 ~ 2x3. 


Thus, 




















MB) = {x = 


x l 

x 2 

x 3 


E R 3 1 x 1 = - x 2 ~ 2 x 3 } 


= {x 6 R 3 I X = 


~x 2 - 2 x 3 
x 2 
x 3 



"-1 ' 


'-2 " 

{x € R 3 | x = X 2 

1 

0 

+ X 3 

0 

1 



"-1 1 


"-2 1 


Span/ 

1 

? 

0 


( 

0 


1 



Thus, the set 



is a spanning set of the null space M(B). 

Let us verify that S is a linearly independent set. Suppose that we have 



"-1 1 


'-2 " 

c i 

1 

+ C 2 

0 


0 


1 


for some scalars c 1? c 2 . Then we have 


C 1 2c 2 


'O' 

C 1 

= 

0 

■ 

<N 


0 


and thus c i = c 2 = 0 and S is linearly independent. 

Hence S is a linearly independent spanning set, and thus S is a basis of the null space A f(B). 


O 

(a) Let A 


13 0 0 

13 12 

13 12 


. Find a basis for the range H(A) of A that consists of columns of A. 


(b) Find the rank and nullity of the matrix A in part (a). 


Solution. 




























(a) Find a basis for the range 7 Z(A) 

Note that the range 7 Z(A) is the column space of A. Thus, 

11(A) = Span {A l9 A 2 ,A 3 ,A 4 }, 

where A - is the i- th column vector of A. 

We use the “leading 1” method. 

We reduce the matrix A by elementary row operations as follows. 
We have 


r 3 r 2 

■1 

3 

0 

o- 

r 2 r x 

■1 

3 

0 

o- 

A -► 

1 

3 

1 

2 


0 

0 

1 

2 


0 

0 

0 

0 


0 

0 

0 

0 


The first and the third column contain the leading 1 ’s. 

Thus, the set {A^A^} is a basis of the range 1Z(A), which consists of the first and the third column 
vectors of A. 

(b) Find the rank and nullity of the matrix A 

By part (a), we know that {A^A^} is a basis of the range of A. 

Hence the dimension of the range is 2. 

Thus the rank of A , which is the dimension of the range 1Z(A), is 2 
Recall the rank-nullity theorem. Since A is a 3 x 4 matrix, we have 

rank of A + nullity of A = 4. 

Since we know that the rank of A is 2, it follows from the rank-nullity theorem that the nullity of A is 

2 . 


IL Let 

, I" 1 1 01 

A 

U i oj 

be a matrix. Find a basis of the null space of the matrix A. 

Solution. 

The null space Af(A) of the matrix A is by definition 

M.A) = {x € R 3 Mx = 0}. 

In other words, the null space consists of all solutions x of the matrix equation Ax = 0. 

So we first determine the solutions of Ax = 0 by Gauss-Jordan elimination. The augmented matrix is 


" 1100 " 
110 0 










Subtracting R l from R 2 , we reduce the augmented matrix to the reduced row echelon form matrix as 
follows. 


■ 1 

1 0 

0 " 

R 2 R\ 

■1 

1 

0 

0 " 

1 

1 0 

0 

> 

0 

0 

0 

0 


Thus the solution x = 


of Ax = 0 satisfies x 1 + x 2 = 0, or equivalently x l = - x 2 . 


Substituting the last equality, we see that solutions are of the form 



"-X 2 ' 


'-1 1 


' O ' 

X = 

■ 

* * 
bo 

■_ 

= X 2 

1 

0 

+ X 3 

0 

1 


Therefore the null space is 




"-1 " 


■o - 

J x £ R 3 

X = X 2 

1 

+ X 3 

0 



0 


1 


for any x 2 , E R 


= Sp< 


1 



"-1 1 


'O’ 


Thus, the set < 

1 

? 

0 


( 

0 


1 



is a spanning set for the null space Af(A). 



'-1 1 


'O' 

Now, we check that the vectors 

1 

? 

0 


0 


1 


are linearly independent. 


Consider a linear combination 



■-1 ■ 


■O' 

«1 

1 

+ a 2 

0 


0 


1 


This is equivalent to 



































Hence we must have a l = a 2 = 0. 


Since the linear combination equation has only the zero solution, the vectors 
independent. 


■-1 ■ 


■ 0 - 

1 

? 

0 

0 


1 


are linearly 


Therefore the set 


space 



■-1 ■ 


'O' 



1 

? 

0 



0 


1 



is a linearly independent spanning set, thus it is a basis for the null 


^ Let 


A = 


(a) Find a matrix B in reduced row echelon form such that B is row equivalent to the matrix A. 

(b) Find a basis for the null space of A. 

(c) Find a basis for the range of A that consists of columns of A. For each columns, Aj of A that does not appear 
in the basis, express Aj as a linear combination of the basis vectors. 

(d) Exhibit a basis for the row space of A. 



In part (c), you may use the following theorem. 


Theorem (leading-1 method): If B is a matrix in reduced row echelon form that is row 
equivalent to A , then all the column vectors of A whose corresponding columns in B have 
leading 1 ’s form a basis of the range of A. 


In part (d), you may use the following theorem. 


Theorem (Row-space method): If B is a matrix in (reduced) row echelon form that is row 
equivalent to A , then the nonzero rows of B form a basis for the row space of A. 


Solution. 

(a) Find a matrix B in reduced roe echelon form such that B is row equivalent to the matrix A. 
















We apply the elementary row operations to the matrix A and obtain 



"1 

1 

2 ' 

R2 — 2 Ri 

Rs — 2 R\ 

'1 

1 

2' 

A = 

2 

2 

4 

- > 

0 

0 

0 


2 

3 

5 . 


0 

1 

1. 


i?2 ^Rs 




'1 

1 

to 

_ 1 

fi 3 

to 

'1 

0 

1" 

0 

1 

1 

-» 

0 

1 

1 

.0 

0 

0 . 


.0 

0 

0. 


The last matrix is in reduced row echelon form that is row equivalent to A. 
Thus we set 


B = 


1 0 1 
0 1 1 
0 0 0 


(b) Find a basis for the null space of A. 


The null space J\f \A) of the matrix is the set of solutions of the homogeneous system Ax. — 0 . 
By part (a), the augmented matrix [A | 0 ] is row equivalent to [B | 0 ]. 
xi 

must satisfy 


Thus, the solution x = 


X2 

L^3 


x\ — —xz and x 2 — 


where £3 is a free variable. 
Thus the solutions are given by 



"-Z3" 



X = 

-X3 

= x 3 

-1 


_ x 3 _ 


. 1 _ 


for any number £ 3 . 
Hence we have 


J\f(A) = {x G M 3 | x = £3 


-1 

-1 

1 


for any x% E 


= Span 


-1 

-1 

1 


From this, we deduce that the set 


is a basis for the null space of A. 

(c) Find a basis for the range of A that consists of columns of A. (Longer version) 



























Let us write 


A = 


1 1 
2 2 
2 3 


2 

4 

5 


[^4i, A 2 ,^4 3 ], 


where A\ , A 2 , A3 are the column vectors of the matrix A. 

The range 1 Z (A) of A is the same as the column space of A. 

Thus, it suffices to find the maximal number of linearly independent vectors among the column 
vectors A \, A2 , A3. 

Consider the linear combination 


X1A1 + x 2 A 2 + x s A 3 = 0 . (*) 

Then this is equivalent to the homogeneous system 


Ax = 0 


and we already found the solutions in part (b): 

x\ = —xs and X2 = -£3. 

This tells us the vectors A\ , A2 , A3 are linearly dependent. 

For example, X\ — —1, £2 — — 1, #3 = 1 is a nonzero solution of the system (*). 
Thus, we have 


7 1 (A) = Span(Ai, A 2 , A 3 ) = Span(Ai, A 2 ). 

On the other hand, if we consider only Ai and A 2 , they are linearly independent. 

(To see this, you just need to repeat the above argument without A3. This amounts to just ignoring the 
third columns in the computations.) 

Therefore, the set {Ai, A 2 } is a linearly independent spanning set for the range. 

Hence a basis of the range consisting column vectors of A is 

{Ai, A 2 }. 

The only column vector which is not a basis vector is A3. 

We already found that X\ = —1, £2 = — 1, ^3 = 1 is a nonzero solution of (*). 

Thus we have 


—A\ — A2 + A3 — 0. 

Solving this for A3, we obtain the linear combination for A3 of the basis vectors: 

A3 = A\ + A2. 

(c) A shorter solution using the leading 1 method 

Here is a shorter solution which uses the following theorem. 

Theorem (leading -1 method): If B is a matrix in reduced row echelon form that is row equivalent to A 
, then all the column vectors of A whose corresponding columns in B have leading 1 ’s form a basis 





of the range of A. 

Looking at the matrix B, we see that the first and the second columns of B have leading 1 ’s. Thus the 
first and the second column vectors of A form a basis for the range of A. 

(d) Exhibit a basis for the row space of A. 

We use the following theorem. 

Theorem (Row-space method): If B is a matrix in (reduced) row echelon form that is row equivalent 
to A, then the nonzero rows of B form a basis for the row space of A. 

We have already found such B in part (a), and the first and the second row vectors are nonzero. Thus 
they form a basis for the row space of A. 

Hence a basis for the row space of A is 



Comment. 

The longer solution of part (c) is essentially the proof of Theorem (leading 1 method). 

It is good to know where the theorem came from, but when you solve a problem you may forget and 
just use the theorem like the shorter solution of (c). 


Let 


A = 


1 

0 

1 

0 

0 


-10 0 

111 
-10 0 

2 2 2 

0 0 0 


(a) Find a basis for the null space AT (A). 

(b) Find a basis of the range 'JZ(A). 

(c) Find a basis of the row space for A. 
(The Ohio State University) 


□ 


Q Determine whether each of the following sets is a basis 


(a) S = < 


1 

0 

-1 


2 

1 

-1 


-2 

1 

4 



f 

T 


"2" 


'3" 

\ 


(b) S = < 


4 

5 

5 

5 

6 





7 


8 


9 




f 

V 


"O' 

\ 




II 

Co 

'o' 


1 

5 

1 


> 






2 


7 






f 

1 


7 


3 


-1 

(d) S = < 


2 

5 

4 

5 

8 

5 

9 


< 

5 


0 


6 


10 

































Definition (A Basis of a Subspace). 

A subset S of a vector space V is called a basis if 


S is linearly independent, and 


S is a spanning set. 


Solution. 

Recall that any three linearly independent vectors form a basis of R 3 . 

(See the post “Three Linearly Independent Vectors in R 3 Form a Basis. Three Vectors Spanning R 3 
Form a Basis ” for the proof of this fact.) 

I 



Let us check that whether S is a linearly independent set. 
Consider the linear combination 



■ 1 ■ 


' 2 1 


■-2 1 

*1 

0 

-1 

+ x 2 

1 

-1 

+ x 3 

1 

4 


This is equivalent to the matrix equation 


' 1 2-2" 


■_ 

0 1 1 


x 2 

i 

T 

T 


*3 


■ ■ 


To find the solution, consider the augmented matrix. 
Applying elementary row operations, we obtain 




1 

2 

-2 

O' 

*3+*l 

■1 

2 

-2 

0 



0 

1 

1 

0 

-► 

0 

1 

1 

0 


— 

1 

-1 

4 

0 


0 

1 

2 

0 

*1-2*2 

-1 

0 

-4 

O' 

*1+4*3 

-1 

0 

0 

O' 

-► 


0 

1 

1 

0 


0 

1 

0 

0 

*3 * 2 

0 

0 

1 

0 

* 2 -* 3 

0 

0 

1 

0 


It follows that the solution is Xj = x 2 = v 3 = 0. 





























Hence S is linearly independent. 

As S consists of three linearly independent vectors in R 3 , it must be a basis of R 3 . 



As in part (a), we determine whether the set S is linearly independent or not by considering the 
following augmented matrix: 


1 1 

2 

3 

o- 

R 2~ 4R \ 

1 1 

2 

3 

o- 

4 

5 

6 

0 


0 

-3 

-6 

0 

7 

8 

9 

0 

* 3 - 7 *! 

0 

-6 

-12 

0 


1 

3 R 2 

"1 

2 

3 

o- 

*1-2*2 

'I 

0 

-1 

0 - 

— > 

0 

1 

2 

0 


0 

1 

2 

0 


0 

-6 

-12 

0 

*3 + 6*2 

0 

0 

0 

0 


Thus, the general solution is = x 3 , x 2 = ~ 2x 3 , where x 3 is a free variable. 
Hence, in particular, there is a nonzero solution. 

So S is linearly dependent, and hence S cannot be a basis for R . 



A quick solution is to note that any basis of R 3 must consist of three vectors. Thus S cannot be a basis 
as S contains only two vectors. 


Another solution is to describe the span Span(S). 


Note that a vector v = 


is in Span(S) if and only if v is a linear combination of vectors in S. 
Equivalently, the vector v is in Span(S) if and only if the system 


-1 

0- 




'a " 




'*i' 



1 

1 



= 

b 

2 

7 


x 2 


c 


is consistent. 

Let us consider the augmented matrix and reduce it by elementary row operations. 





























1 1 

0 

a ' 

r 2 

"1 

0 

a " 

1 

1 

b 


0 

1 

b - a 

2 

7 

c 

R 3 -2R, 

0 

7 

c - 2a 


r 1 

o 

o 


0 a' 

1 b - a 

0 5a~ lb + c 


Note that we obtained the (3, 3)-entry by c - 2a - l(b - a) = 5a - lb + c. 
It follows that the system is consistent if and only if 

5a - lb + c = 0. 


Thus, for example, the vector 


is not in Span(S) as5-l-7-0 + 0^0. 


Hence Span(£) is not R , and we conclude that S is not a basis. 


(d) S = 


l 

■1 ■ 


.7. 


■3- 


1 

2 

? 

4 

? 

8 

? 


5 


0 


6 



-1 

9 

10 


The set S contains four 3-dimensional vectors. Hence S is linearly dependent, and thus S is not a basis. 


Suppose that the vectors 


1 


1 

1 


0 

0 

j v 2 = 

-3 

0 


1 

to 

1 

0 


—1 

T—1 

_1 


are a basis vectors for the null space of a 4 X 5 matrix A. Find a vector x such that x ^ 0, x ^ Vi, x ^ V 2 and 
Ax = 0. 

(Stanford University) 


Solution. 


We are asked to find a vector x in the null space of A, which are not 0, Vi, V 2 . 


Recall that the null space is a vector space. Thus, any linear combination of vectors in the null space is 
still in the null space. 

Since Vi, V 2 are basis of the null space of A, they are in particular vectors in the null space of A. 
Thus, for example, 























is an element of the null space and it is not equal to 0 , Vi , V 2 . 


Another example is 


x 


-6 


vi + v 2 = 


1 

-3 

-2 

1 


In general, you can prove that any vector of the form 


x = C1V1 + c 2 v 2 , 

where C\ , C 2 are scalars such that (ci , 02 ) 7 ^ (0, 0), (1, 0), (0,1), satisfied the required conditions. 


^ Let V be the following subspace of the 4-dimensional vector space ] 



s 

Xl 



V < 


X3 

G M 4 

Xl — X2 + x 3 — X4 = 0 > 


K 

_ ^4_ 


> 


Find a basis of the subspace V and its dimension. 


Solution. 


x 2 


x 3 



Any vector x = 


in V satisfies - x 2 + - x 4 = 0, or equivalently x l = x 2 ~x 3 + x 4 . 














■ 

>r 


■jc 2 - X 3 + x 4 1 

X 2 


X 2 

x 3 


x 3 

m _ 


m 

X 

4^ 



1 1 1 


■-1 1 


■1 1 


1 


0 


0 

x 2 

0 

+ X 3 

1 

+ X 4 

0 


0 

1 ■ 


0 

■ ■ 


1 

1 1 


Let 



■1 ■ 


■-1 ■ 


■1 ■ 


1 


0 


0 

Ul = 

0 

> u 2 = 

1 

u 3 = 

0 


0 

■ ■ 


0 

■ ■ 


1 

■ ■ 


The above computation shows that any vector x in V can be written as a linear combination of the 
vectors u 1? u 2 , u 3 . 

Hence the set {u 1? u 2 , u 3 } is a spanning set for the subspace V. 


We claim that {u 1? u 2 , u 3 } is a linearly independent set. 
Consider 


“1“ ^ 2 U 2 “1“ ^3^3 0. 

We show that this equation has only the zero solution a x = a 2 = a 2 = 0. 
The equation (*) can be written as 


1 ai ~ a 2 + a 2 1 


a l 

a 2 


= 0 . 


<*3 


Comparing entries, we obtain a j = a 2 = a 2 = 0. 

Thus the equation (*) has only the zero solution and hence the vectors u 1? u 2 , u 3 are linearly 
independent. 


Therefore, the set {u 1? u 2 , u 3 } is linearly independent spanning set for V, thus it is a basis for the 
subspace V. 

Since the basis consists of 3 vectors, the dimension of the subspace Lis 3. 




















In summary, we found a basis 



for the subspace V and the dimension of V is 3. 




A hyperplane in n-dimensional vector space M n is defined to be the set of vectors 


Xi 

X2 


satisfying the 


linear equation of the form 


a 1 x 1 + a 2 x 2 


Q'n'X'n — 


where ai, a 2 ,..., a n (at least one of di , a 2 ,..., a n is nonzero) and b are real numbers. Here at least one of 
a l 5 a 2 j • • • j a n is nonzero. Consider the hyperplane P in M n described by the linear equation 


a\X\ + a 2 x 2 


CL n X n — 0, 


where a \, a 2 ,..., a n are some fixed real numbers and not all of these are zero. 

(The constant term b is zero.) Then prove that the hyperplane P is a subspace of R n of dimension n — 1. 

Proof. 

The hyperplane is a subspace. 


In a set theoretical notation, the hyperplane is give by 



' X 1' 



< X = 

X 2 

6 R n 

a 1 x 1 +a 2 x 2 + - + a n x n = 0\ 


X n 




■ i 




The defining relation 


a 1 x 1 + a 2 x 2 + + a n x n = 0 


can be written as 


a x a 2 


= 0. 


Let A be the 1 x n matrix^ = [ a \ a 2 fl «]. 

Then the above equation is simply Ax = 0, and the hyperplane P is described by 


















P = {x e R n \Ax = 0}, 

and this is exactly the definition of the null space of A. Namely, we have 

p = Ma). 

Since every null space of a matrix is a subspace, it follows that the hyperplane P is a subspace of R”. 

The dimension of the hyperplane is n - 1. 

Because P = Af(A), the dimension of P is the nullity of the matrix A. 

Since not all of a? s are zero, the rank of the matrix A is 1. Then by the rank-nullity theorem, we have 

rank of A + nullity of A = n. 

Hence the nullity of A is n - 1. 


Let A be a real 7x3 matrix such that its null space is spanned by the vectors 


"1" 


"2" 


" 1 " 

2 

5 

1 

, and 

-1 

_ 0 _ 


_ 0 _ 


_ 0 _ 


Then find the rank of the matrix A. 

(Purdue University) 

Solution. 

We first determine the nullity of A and deduce the rank of A by the rank-nullity theorem. 
The null space Af(A) of the matrix A is spanned by 


r 


-2 ' 


1 1 1 

2 

? 

1 

, and 

-1 

0 


0 


0 


Let us find a basis of the null space Af(A) among these vectors. 

We use the “leading 1 method”. 

Form the matrix whose column vectors are these three vectors and we apply elementary row 
operations as follows. 


"1 

2 

1 1 

r 2 

- 2*1 

"1 2 

1 - 

i 

~~ 3 R 2 

2 

1 

-1 


-► 


o 

3 -3 

-► 

0 

0 

0 




0 0 

0 




■1 

2 

1 ‘ 

R 

1 - 2*2 

"1 0 

-1 ‘ 



0 

1 

1 



0 1 

1 



0 

0 

0 



■ 

o 

o 

0 


The last matrix is in reduced row echelon form and the first and second column contain the leading 
l’s. 























Therefore, the first two vectors 


"1 ■ 


'2 " 

2 


1 

0 


0 


form a basis of the null space Af(A). 

Hence the nullity, which is the dimension of Af(A), is 2. 


Since the size of the matrix A is 7 x 3, the rank-nullity theorem gives 

3 = nullity of ^4 + rank ofv4. 


Thus, the rank of A is 1. 


and consider the following subset V of the 2-dimensional vector space M 2 . 

V = {x e M 2 | Ax = 5x). 

(a) Prove that the subset V is a subspace of R 2 . 

(b) Find a basis for V and determine the dimension of V. 



Proof. 

(a) V is a subspace of R 2 


Note that the equality Ax = 5x can be written as 

(A - 51)x = 0, 

where / is the 2 x 2 matrix. 

If we define B = A - 51, then the subset V becomes 

V = {x £ R 2 \Bx = 0} 

= MB), 


the null space of the matrix B. 

Since any null space is a vector space, this shows that V is a subspace of R . 


(b) A basis for V and the dimension of V 


From part (a), we have obtained that 


V=N(B\ 


where 


B= A-51 = 


-1 

3 


1 

-3 


Thus, V is a set of solutions of the system Bx = 0. 










The augmented matrix for this system is reduced as follows. 


3-3 0 


From this we see that the general solution x = 


R2 + 3R\ 

"-I 

1 

0" 

-*1 

■1 

-1 

0" 


0 

0 

0 


0 

0 

0 


satisfies 


*1 = x 2 > 


and thus the general solution is 


Therefore we have 



' x 2 a 


- 

X = 

x 2 

= x 2 

1 


V= U e R^ 


X = Jo 


1 1 

1 


for any x 2 E R 


Span 


Thus the set 


n 

i 


is a spanning set of V and it is linearly independent as it consists of only a single vector. 
Therefore 


n i 

i 


is a basis for the subspace V, and thus the dimension of V is 1. 

(Recall that the dimension of a vector space is the number of vectors in a basis for the vector space.) 


Let If be a subset of R 4 consisting of vectors that are perpendicular to vectors a, b and c, where 



"1" 


'1' 


" o " 

a — 

0 

1 

, b = 

1 

0 

J c = 

i 

-l 


_0_ 


0 


. 0 . 


Namely, 

V = {x G R 4 | a T x — 0, b T x — 0, and c T x — 0}. 

(a) Prove that If is a subspace of R 4 . 

(b) Find a basis of V. 





























(c) Determine the dimension of V. 


Proof. 

(a) Prove that V is a subspace of R 4 . 
Observe that the conditions 


a T x = 0,b T x = 0, and c T x = 0 
can be combined into the following matrix equation 


Ax = 0 , 


where 


A = 


1 0 
1 1 
0 1 


1 0 

0 0 

-1 0 


and 0 is the three dimensional zero vector. 

a T i T T 

Note that the rows of the matrix A are a , b , and c . 

It follows that the subset V is the null space Af(A) of the matrix A. 

Being the null space, V — Af(A) is a subspace of R 4 . 

(See the post “The Null Space (the Kernel) of a Matrix is a Subspace of R n “.) 


(b) Find a basis of V. 


In the proof of Part (a), we saw that V — J\f (A). 

To find a basis, we determine the solutions of Ax. = 0. 

Applying elementary row operations to the augmented matrix, we see that 


1 

0 

1 

0 

0 

f?2 —R\ 
-» 

" 1 

0 

1 

0 

1 

1 

0 

0 

0 

0 

1 

-1 

0 

0 

_1 

1 

-1 

0 

0 


0 

1 

-1 

0 


i?3 — R 2 
- > 


1 0 
0 1 
0 0 


1 0 

-1 0 

0 0 


0 

0 

0 


It follows that the general solution is given by 

Xi = -xz,x 2 = £C 3 . 

The vector form solution is 



~ Xi 


"-£3" 


■-1' 


"o' 




£3 


1 

+ X4 

0 

X = 

£3 


X3 

= X3 

1 

0 



_X 4 _ 


X4 


. 0 . 


_i_ 


Hence we have 























V = N(A) 


/ 


'-1' 


"O' 

\ 

xeK 4 

X = Xs 

1 

1 

+ X 4 

0 

0 

, where E R 

l 


. 0 . 


_1_ 

J 




Then we just showed that B is a spanning set for V. 

It is straightforward to see that B is linearly independent. 

Hence B is a basis for V. 

(c) Determine the dimension of V. 

As the basis B for V that we obtained in Part (b) consists of two vectors, the dimension of the 
subspace V is dim(y) = 2. 


0 Find a basis for the subspace W of all vectors in M 4 which are perpendicular to the columns of the matrix 


11 

12 

13 

14 

21 

22 

23 

24 

31 

32 

33 

34 

41 

42 

43 

44 


(Harvard University Exam) 


Hint. 

O Show that W — Af(A T ). 


0 Find a basis of Af(A T ) by reducing the matrix A T . 


Solution. 

Let us write A — [A\ A 2 A% A 4 ] , where A{ is the i- th column vector of A for i — 1, 2, 3,4. 
First we claim that a vector x G R 4 is perpendicular to all column vectors A{ if and only if 

X E A r (^4 T ). 

To see this, we compute 
























\AJ] 





A^x. 


X = 

^3 X 

kJ 


.^4 X . 


From this equality the claim follows immediately. 

So we proved that J\f(A T ) — W. From this, we see that W is actually a subspace of R 4 . 


A T 1 

Thus, we need to find a basis for the null space of the transpose A . 

A T 1 

We apply elementary row operations to A and obtain a reduced row echelon form 


A 1 


1 0 -1 -2 

0 12 3 

0 0 0 0 

0 0 0 0 


The last two columns correspond to two free variables. Let s and t be free variable. 
'xi 


Thenx = 


X2 

X3 
X 4 _ 


G J\f(A T ) if and only if x satisfies 


X\ = s + 2t 
X 2 — —2 s — 3 1 
xs = s 
= t , 


equivalently 


' 1 ' 


1 

to 

-2 

+ t 

-3 

1 

0 

. 0 . 


. 1 . 


Therefore a basis of W = J\f(A T ) is 


' 1 " 


■ 2' 

-2 

and 

-3 

1 


0 

. 0 . 


. 1. 


€> Let V be the vector space over M of all real 2x2 matrices. Let W be the subset of V consisting of all 
symmetric matrices. 

(a) Prove that W is a subspace of V. 

(b) Find a basis of W. 

(c) Determine the dimension of W. 


















Proof. 

Recall that A is symmetric if A T — A. 

(a) Prove that W is a subspace of V. 
We verify the following subspace criteria: 


Subspace Criteria. 

The zero vector of V is in W. 

For any A,B G W, the sum A + B G W. 

^ For any A £ W and r G R, the scalar product r A G W. 


The zero vector in V is the 2 X 2 zero matrix O. 

It is clear that 0 T = O, and hence O is symmetric. 
Thus O G IV and condition 1 is met. 


Let A, B be arbitrary elements in W. 

That is, A and B are symmetric matrices. 

We show that the sum A + B is also symmetric. 
We have 


(A +B) T = A T + B T = A + B. 


The second equality follows as A, B are symmetric. 
Hence A + B is symmetric and 4 + BG W. 
Condition 2 is met. 


To check condition 3, let A G VT and r G R. 
We have 


(■ rA) T = rvl T = rA , 


where the second equality follows since ^4 is symmetric. 

This implies that r^4 is symmetric, and hence rA G W. 

So condition 3 is met, and we conclude that W is a subspace of V by subspace criteria. 


(b) Find a basis of W. 
Let 


A 


an 

a 12 

_a 2 i 

«22 


be an arbitrary element in the subspace W. 
Then since A T = A, we have 


an 

«21 


an 

<212 

_Ul 2 

«22 . 


.U 21 

<222 . 


This implies that a \2 = 0 , 21 , and hence 


<211 ai 2 
.<212 < 222 . 


= «n 


1 

0 


0 

0 


+ 012 


0 

1 


1 

0 


+ 022 


0 

0 


0 

1 


Let B — {vi , 2)2 , t >3 }, where V\ , v 2 , u 3 are 2 X 2 matrices appearing in the above linear 
combination of A. 

Note that these matrices are symmetric. 

Hence we showed that any element in W is a linear combination of matrices in B. 

Thus B is a spanning set for the subspace W. 


We show that B is linearly independent. 

Suppose that we have 

C\V\ + c 2 v 2 + C 3 U 3 = 

Then it follows that 


Cl 

C 2 _ 


'0 

o' 

_C 2 

C 3 _ 


.0 

0. 


0 0 
0 0 


Thus Ci = C 2 = C 3 = and the set B is linearly independent. 

As B is a linearly independent spanning set, we conclude that B is a basis for the subspace IV. 
(c) Determine the dimension of W. 

Recall that the dimension of a subspace is the number of vectors in a basis of the subspace. 

In part (b), we found that B — {t>i, v 2 , C 3 } is a basis for the subspace W. 

As B consists of three vectors, the dimension of W is 3. 

Related Question (Skew-Symmetric Matrices) 

A matrix A is called skew-symmetric if ,4 * = —A. 























Problem. 


Let V be the vector space of all 2 X 2 matrices. 

Let W be a subset of V consisting of all 2 X 2 skew-symmetric matrices. 
Prove that W is a subspace of V and also find a basis and dimension of W . 


Suppose that Vi, v 2 ,..., v r are linearly dependent n-dimensional real vectors. For any vector v r+ i E R n 
determine whether the vectors Vi, v 2 ,. . ., v r , v r+ i are linearly independent or linearly dependent. 

Solution. 

We claim that the vectors v 1? v 2 , .. v r , v r+ 1 are linearly dependent. 

Since the vectors v 1? v 2 , ..., v r are linearly dependent, there exist scalars (real numbers) a x , a 2 ,. . 
such that 


a l \ l +a 2 x 2 + - + a r x r = 0 

and not all of a x , ... 9 a r are zero, that is, (a 1? ..., a r ) ^ (0, ..., 0). 

Consider the equation 

*1 V 1 + *2 V 2 + - + X r y r + l V r + 1 = °- 

If this equation has a nonzero solution (jc 1? ..., x r , x r+ j), then the vectors v 1? . .., v r+ ^ are linearly 
dependent. 


In fact, 


(x 1? x 2 , ...,x r ,x r+1 ) = (a v a 2 , ... 9 a r , 0) 

is a nonzero solution of the above equation. 

To see this, first note that since not all of a x ,a 2 , ..., a r are zero, we have 

(«l,fl 2 , 0) ^ (0, o, ...,0, 0). 

Plug these values in the equation, we have 

^l v l + ^2 V 2 + *** + a r y r + 0v r+1 

= a l \ l + a 2 x 2 + ••• + a r x r = 0 by (*). 


Therefore, we conclude that the vectors v 1? v 2 , ..., v r , x r+1 are linearly dependent. 


Let B — { Vi , v 2 , V3 } be a set of three-dimensional vectors in M 3 . 

(a) Prove that if the set B is linearly independent, then B is a basis of the vector space R 3 . 

(b) Prove that if the set B spans R 3 , then B is a basis of R 3 . 




Definition (A Basis of a Subspace). 

A subset B of a vector space V is called a basis if B is linearly independent spanning set. 

Proof. 

(a) Prove that if the set B is linearly independent, then B is a basis of the vector space R 3 . 

To show that B is a basis, we need only prove that B is a spanning set of R 3 as we know that B is 
linearly independent. 

Let b e R 3 be an arbitrary vector. 

We prove that there exist x 1? x 2 , x 3 such that 

x l v l +X 2 V 2 +X 3 V 3 = b 



This is equivalent to having a solution x 


x 2 

* 3 


to the matrix equation 


Ax = b, 

where 

A = [v 1 ,v 2 ,v 3 ] 

is the 3 x 3 matrix whose column vectors are v 1? v 2 , v 3 . 

Since the vectors v 1? v 2 , v 3 are linearly independent, the matrix^ is nonsingular. 

It follows that the equation (*) has the unique solution x = A ~ 1 b. 

Hence b is a linear combination of the vectors in B. 

This means that B is a spanning set of R 3 , hence B is a basis. 

(b) Prove that if the set B spans R 3 , then B is a basis of R 3 . 

As we know that B spans R 3 , it suffices to show that B is linearly independent. 

Note that the assumption that B is a spanning set of R 3 means that any vector b in R 3 is a linear 
combination of v 1? v 2 , v 3 : there exist c 1? c 2 , c 3 such that 

c l y l +c 2 v 2 + c 3 v 3 = h - 

Equivalently, for any b 6 R 3 , the equation Ax = b has a solution. 


Letv4 = [v 1? v 2 , v 3 ] as before. 
It follows that the equation 


Ax 


■1 1 

0 

0 


has a solution x = u 1 . 
Similarly the equations 







'O' 


'O' 

Ax = 

1 

, Ax = 

0 


0 


1 


have solutions u 2 , u 3 , respectively. 


Define the 3x3 matrix A by 


Then it follows that 


A = [Uj,u 2 ,u 3 ]. 

AA =^4[u 1? u 2 , u 3 ] 

= [^u 1 ,^4u 2 ,^u 3 ] 

■1 0 O' 

= 010 . 

0 0 1 


As the identity matrix is nonsingular, the product AA is nonsingular. 

Thus, the matrix A is nonsingular as well. 

This implies that the column vectors of A are linearly independent. 

Hence the set B is linearly independent and we conclude that B is a basis of R 3 . 


Let V be a subspace of M n . Suppose that S — {vi, v 2 ,..., v m } is a spanning set for V. Prove that any set of 
ra + 1 or more vectors in V is linearly dependent. 

We give two proofs. The essential ideas behind those two proofs are the same. 

The second proofs uses matrix equations to organize clutter of a system of linear equations. 

Proof 1. 

Consider the set T = {u 1? u 2 , ..., u^} of k vectors in the subspace V, where k > m + 1 . 

We prove that the set T is linearly dependent. 

Consider the linear combination 


x 1 u l +x 2 u 2 + -+X k u k = 0, 

where 0 is the zero vector of R”. 

Our goal is to find a nonzero solution jc 1? x 2 , ..., x n of this equation. 


As S is a spanning set for V, each vector v. in T can be written as a linear combination of vectors in S. 
So we have 









Using these linear combination, the equation (*) can be written as 


x l( a ll v l + a 21 v 2 + •" + a m\ y m) + x 2( a 12 v l + a 22 v 2 + + a m2 v m) 

+ ••• + X k (a lk \ l + a 2k \ 2 + ■■■ + a mk \ m ) = 0 . 

Rearranging terms, we obtain 

( a \\ x \ +a u x 2 + - + a^ k )\ x + (a 2l x 1 + a 22 x 2 + - + a 2k x k )\ 2 
+ - + («„!*! + a m 2 x 2 + - + a mk x k) y m = 0 


Now it suffices to find a nonzero x 1? . . .x k such that the coefficients of v 1? ..\ m in (**) are all zero. 
Thus, we need to find a nonzero solution of the following system. 

a \\X\ + a 12 x 2 + ••• + a lk x k = 0 

a 2 iX i + a 22 x 2 + *** + a 2k x k = ^ 
a ml x l + a m2 x 2 + "* + Vr °- 

Observe that this is a m x k homogeneous system and as m + 1 < k, there are more variables than 
equations. 

Hence this homogeneous system has infinitely many solutions, in particular, it has a nonzero solution 

* 1 , 

Hence the equation (**) and, equivalently, the equation (*) has a nonzero solution jc 1? 

Therefore the set T is linearly dependent. This completes the proof. 

Proof 2. 

Let T = {u 1? u 2 , ..u^} be the set of k vectors in the subspace V, where k>m+ 1. 

We want to show that T is linearly dependent. 

This is equivalent to finding a nonzero solution x 1? ..., x k of the equation 

+ x 2^2 x k^k = 


The equation can be written as 


Ux = 0, 


where 


Note that U is an n x k matrix. 


Since the subspace V is spanned by the set S = {v 1? v 2 ,..v m }, we have 


111 ^11^1 ^21 V 2 


u 2 = ^i2 v l + a 22 y 2 + *' 

" + a m2 V m 

n k = a lk*l +a 2k y 2 + " 

a minin' 


Let V = [v 1? v 2 ,v w ] be the n x m matrix whose column vectors are v 1? ..., v m . 
Then the above linear combinations can be expressed as 


■ 

■_ 


■ 

■_ 


■_ 

a 2\ 

ii 

a 

= u 2 , V 

a 2k 

■ 


a n,2 

1 1 


s 

a 

_■ 


Combining these into one matrix equation, we obtain 

VA = U, 

where 

Every Basis of a Subspace Has the Same Number of Vectors 


a ii 

a \2 - 

- a \k 

a 2l 

a 22 

a 2k 

a m\ 

a m2 - 

■ a mk 


Thus, the equation (***) becomes 

VAx = 0. 


Note that A is an m x k matrix with k> m. 

So the homogeneous system Ax = 0 has a nonzero solution x = x 0 since there are more variables than 
equations. 

Hence we have VAx 0 = VO = 0, and the equation (***) has a nonzero solution x Q . 

Thus we conclude that the set T is linearly dependent, which completes the proof. 












Related Question. 

As an application of this problem, you may want to prove the following problem. 


Problem. 

Let Vbe a subspace of R n . 

Suppose that B = {v 1? v 2 ,..v^} is a basis of the subspace V. 
Prove that every basis of V consists of k vectors in V. 


Let V be a subspace of M n . Suppose that B = {vi, V 2 ,..., } is a basis of the subspace V. Prove that every 

basis of V consists of k vectors in V. 

Hint. 

You may use the following fact: 


Fact. 

If S = {v 1? ..., \ m } is a spanning set of a subspace V of R n , then any set of m + 1 or more vectors 
of V is linearly dependent. 


For a proof of this fact, see the post 1 

If there are More Vectors Than a Spanning Set, then Vectors are Linearly Dependent 

Proof. 

Let B = {w 1? w 2 , ..Wj} be an arbitrary basis of the subspace V. 

Our goal is to show that l = k. 


As B is a basis, it is a spanning set for V consisting of k vectors. 

By the fact stated above, a set of k + 1 or more vectors of V must be linearly dependent. 
Since B is a basis, it is linearly independent. 

It follows that / < k. 


We now change the roles of B and B . 

As B is a basis, it is a spanning set for V consisting of / vectors. 

So it follows from Fact that a set of / + 1 or more vectors must be linearly dependent. 
Since B is a basis, it is linearly independent. 

Hence k< 1. 


Therefore we have / < k and k < /, and it yields that l = k, as required. 


o Prove the followings. 

(a) Af(A) =J\f(A T A). 

(b) rank(vl) = rank(j4 T vl). 

Hint. 

For part (b), use the rank-nullity theorem and the result from (a). 

The rank-nullity theorem says that for a m x n matrix, 

rank of A + nullity of A = n. 

Proof. 

(a )Af(A) = M(A 1 A). 

Show Af(A) c MA r A) 

Consider any x E Af(A). Then we have Ax = 0. Multiplying it by A T from the left, we obtain 

A t Ax=A t 0 = 0. 

Thus x E Af(A T A), and hence Af(A) c Af(A T A). 


Show A f(A) 3 Af(A T A) 


On the other hand, let x 6 Af(A T A). Thus we have 

A t Ax = 0. 

Multiplying it by x T from the left, we obtain 

x t A t Ax = x T 0 = 0. 


This implies that we have 

0 = f4x) T 04x) = I \Ax\ I 2 

and the length of the vector Ax is zero, thus the vector Ax = 0. Hence x e Af(A), and we obtain 
Af(A) 3 Af(A r A). 

(b) rank(H) = rank (A T A) 

We use the rank-nullity theorem and obtain 

rank(^4) = n dim(Af(A)) = n - dim(A f{A T A)) = rank(^4 T ^4). 


(Note that the size of the matrix A T A is n x n.) 


0 Let dbeanmxn matrix and B be an n X l matrix. Then prove the followings, 
(a) va,nk(AB) < rank [A). 






(b) If the matrix B is nonsingular, then rank (AB) — rank (^4). 

Hint. 

The rank of an m x n matrix M is the dimension of the range 1Z(M) of the matrix M. 

The range of the matrix M is 

7 Z(M) = {y E R m | y = Mx for some x E R n }. 

Proof. 

(a) x‘&x\k(AB) < rank(v4). 

Recall that the rank of a matrix M is the dimension of the range 1Z(M) of the matrix M. 

So we have 

rankfTS) = dim(7 Z(AB)), rankf4) = dim(1Z(A)). 

In general, if a vector space V is a subset of a vector space W, then we have 

dim(F) < dim (W). 

Thus, it suffices to show that the vector space 7 Z(AB) is a subset of the vector space 11(A). 

Consider any vector y e 1Z(AB). Then there exists a vector x e R l such that y = (AB)x by the 
definition of the range. 

Let z = Bx E R n . 

Then we have 

y = A(Bx) = Az 

and thus the vector y is in 1Z(A). Thus 1Z(AB) is a subset of 1Z(A) and we have 
xdivk{AB) = 6ixn(1Z(AB)) < 6im(1Z(A)) = rankf4) 

as required. 

(b) If the matrix B is nonsingular, then rank(^Z?) = rank(^). 

Since the matrix B is nonsingular, it is invertible. Thus the inverse matrix B ~ 1 exists. We apply part 
(a) with the matrices AB and B ~ l , instead of A and B. Then we have 

rank((v4i?)i? -1 ) < rank (AB) 

from (a). 

Combining this with the result of (a), we have 

rank(yf) = xdLX\k((AB)B 1 ) < rank(^45) < rank(^4). 

Therefore all the inequalities are in fact equalities, and hence we have 

x&vkiAB) = rankf4). 


o Let V and W be subspaces of R n such that V D W — {0} and dim(H) + dim(fF) = n. 

(a) If v + w = 0, where v E V and w E W, then show that v = 0 and w = 0. 

(b) If B\ is a basis for the subspace V and B 2 is a basis for the subspace W, then show that the union B\ U B<i 
is a basis for R n . 

(c) If x is in R n , then show that x can be written in the form x = v + w, where v E V and w E W. 

(d) Show that the representation obtained in part (c) is unique. 

Hint. 

Try to show that the vector v is in both V and W, hence in V fl W. 

Proof. 

(a) Show that v = 0 and $\mathbf{w}=\mathbf{0} 

Suppose that we have v + w = 0, where v E V and w E W. Then we have v = - w. Note that the left 
hand side is in V and the right hand side is in W. Thus the vector v is in both V and W, that is 
vE V C\ W = {0}. Hence v = 0, and it follows that w is also the zero vector. 


(b) Show that the union B x U B 2 is a basis for R n 

Let B l = {v 1? v 2 ,v^} and B 2 = {w 1? w 2 , w^} be bases for V and W, respectively. 

Note that dim(F) = k, dim (W) = /, and k+ l = n. 

We claim that the set B x U B 2 is linearly independent. 

Suppose that we have a linear combination of vectors in B x U B 2 

c l v l + C 2 V 2 + - + c k y k + rf l w l + ^2 W 2 + *" + d l™l = °- 

Then observe that the vector v := c 1 v 1 + c 2 \ 2 H- 1 - c k \ k is in V and the vector 

vr=d 1^1 + d 2 vf 2 H-b djWi is in W. Since their sum v + w = 0, by part (a), we must have v = 0 and 

w = 0 . 

Since B l is a basis for V, the vectors v • are linearly independent. Thus 

0 = v = c 1 v 1 +c 2 v 2 + - + c jfc v it 
implies that the coefficients c { are all zero. 

Similarly, since B 2 is a basis for W, 

0 = w = d l Wj + d 2 w 2 H-b djWj 


implies that we have d t = 0. 

Therefore the coefficient of the linear combination (*) is all zero. Thus the vectors 
v 1? ..v k , w 1? ..w / are linearly independent. 

Since B l U B 2 consists of n linearly independent vectors in the n dimensional vector space R n , they 
must be a basis. 

(c) Show that x can be written in the form x = v + w 


For any vector x E R", since B ^ U B 2 is a basis we have 

x = CjVj + C 2 v 2 + - + C k y k + d x Wj + d 2 W 2 + - + d,yf, 

for some c 1? c k , d 1? 

Let 


and 


V = C l \ l + C 2 \ 2 + + C k \ k 


W = (ijWj + J 2 W 2 H -f ^/ w /- 


Then we have v E V,w E W, and x = v + w as required. 

(d) Show that the representation obtained in part (c) is unique. 

Suppose that we have two representations for x, that is, we have x = v + w in addition to the 
x = v + w, where v, v E V and w, w E W. 

Then we have v + w = x = v +w,or equivalently, 

v - V = W - w. 

Observe that the left hand side is in V and the right hand side is in W thus we have 

v-v ,w w G V n W = {0}. 

Thus we have v = v and w = w and we conclude that the representations are actually identical, 
hence we proved the uniqueness. 


(J) Let A be an m X n matrix. Prove that the rank of A is the same as the rank of the transpose matrix AA . 

Hint. 

Recall that the rank of a matrix A is the dimension of the range of A. 

The range of A is spanned by the column vectors of the matrix A. 

Proof. 

We write A = {a A) and let 


A,- = 


l 2 i 


be the /-th column vector of A for / = 1, 2, ..., n. 
Also let 





B- 


be the i -th column vector of A T for i = 1,2 , that is B t is the transpose of the i -th row vector of A 


Suppose that rank(vf) = dim(7£(^4 T )) = k and let {v 1? v 2 ,..v^} be a basis for the range 1Z{A T ). 
We write 


v i2 


for i = 1,2, k. 

T 

Then each column vector B z - of ^4 is a linear combination of {v 1? v 2 , ..v^}. Thus we have 

B i = c n v i + - + c uV 
B 2 = c 21 v 1 + "' +c 2k v lc 

B m c ml V l c mlt^k' 


More explicitly we have, 


' a ll ' 


■vn' 



'V*! 1 

a \2 

= c \\ 

Vi2 

+ •• 

'• + C U 

v k2 

a \n 

1 1 


v l» 

1 1 



v kn 

1 1 


'«21 ' 


■vn 1 


' v k\ ' 

a 22 

= C 21 

v 12 

+ - + c 2k 

v k2 

a 2n 

1 1 


v ln 

1 1 


v kn 

1 1 


f n 1 

a m\ 


,, i 

V 11 



'vh 1 

a m2 

= c m\ 

v 12 

+ " 

" + c mk 

v k2 

a mn 

■ i 


v l« 

1 1 



v kn 

1 1 
























Now, we look at the i -th entries for the above vectors and we have 


+ 

> 

II 

<r 

- +c lk v ki 

a 2i = c 21 v U + " 

" + c 2 k v ki 


a mi = C ml V li + - +C mk V ki 


We rewrite these as a vector equality and obtain 


' a \i ' 


f r 1 

c n 



t r 1 

C lk 

a 2i 

= V \i 

C 2l 

+ •• 

- +V H 

c 2k 

a ni 

1 1 


C m\ 

■ ■ 



■ 

ri 

3 


- v lz c l + '" + v kfih 


where we put 


c r 


c y 


c • 
mj 


for j = 1,2, k. 


This shows that any column vector A f of A is a linear combination of vectors c 1? .. c k . Therefore we 
have 


71(A) = Span{A 1? ...,A n } c Span{c 1? 

Since the dimension of a subspace is smaller than the dimension of a vector space containing it, we 
have 


rank(v4) = dim(7l(A) < dim(Span{c 1? ..., c^}) < k. 


Hence we obtain 


rank(^4) < rank(^4 T ). 


To achieve the opposite inequality, we repeat this argument using A T , and we obtain 

rank(^4 T ) < rank((^ T ) T ) = rank(^4) 

since we have (A T ) T = A. 










Therefore, we proved the required equality 


rank(v4) = rank(v4 T ). 









Subspaces in General Vector Spaces 


Definition 


Q If a nonempty subset W of a vector space V is itself a vector space, we call W a subspace in V . 


Summary 


(Subspace Criteria) A subset W of a vector space V is a subspace if and only if 

1. The zero vector in V is in W. 

2. For any vectors A, B £ VF, the addition 4 + B G W. 

3. For any vector A £ W and a scalar c, the scalar multiplication cA £ W\ 


Problems 

Each of the following sets are not a subspace of the specified vector space. For each set, give a reason why it is not 
a subspace. 

( 1 ) 


in the vector space ] 

( 2 ) 


in the vector space ] 

( 3 ) 


S i = < 


< 

"®l’ 

^2 

e M 3 

IV 

0 


_^ 3 _ 


V 


s 2 = < 

✓ 

~ X\ 

e M 3 

\ 

x\ — 4^2 + 5^3 = 2 > 



_^3_ 


> 


s* = 


x 

ly] 


y = x 


in the vector space IRl. 

( 4 ) Let P4 be the vector space of all polynomials of degree 4 or less with real coefficients. 


S 4 = {f(x) E P 4 | /(1) is an integer} 


in the vector space P4. 

( 5 ) 


S 5 = {f(x) E P 4 | /(l) is a rational number} 


in the vector space P4. 

(6) Let M 2x2 be the vector space of all 2 X 2 real matrices. 





























Sq — {A G. M 2x2 I det(A) ^ 0} 


in the vector space 71^2x2- 

(7) 


S? — {A E M2X2 I det(A) — 0} 


in the vector space M2 X 2 • 

(8) Let C[— 1,1] be the vector space of all real continuous functions defined on the interval [a, b ]. 

5 8 = {/(x)eC[-2,2]|/(-l)/(l) = 0} 

in the vector space C\— 2 , 2 ]. 

(9) 


Sq — {f(x) E C[— 1,1] | f(x) > 0 for all — 1 < x < 1} 
in the vector space C[— 1 , 1 ]. 

(10) Let C 2 [a, 6 ] be the vector space of all real-valued functions /(x) defined on [a, &], where /(cc), f'{x ), 
and f"{x) are continuous on [a, b\. Here f'{x ), f"{x) are the first and second derivative of /(#). 

5io = { f(x ) E C 2 [—1,1] | f"(x) + /(x) = sin(x) for all — 1 < x < 1} 

in the vector space C[— 1 , 1 ]. 

(11) Let be the set of real polynomials of degree exactly k, where k > 1 is an integer, in the vector space P&. 

(12) Let V be a vector space and W C V a vector subspace. Define the subset S 12 to be the complement of W, 

V\W={veV\v<£W}. 

Solution. 

Recall the following subspace criteria. 

A subset W of a vector space V over the scalar field if is a subspace of V if and only if the 
following three criteria are met. 


The subset W contains the zero vector of V . 


Q If u,v E W, then u + v E W . 


Q If u E W and a E K , then au E W. 


Thus, to prove a subset W is not a subspace, we just need to find a counterexample of any of the 
three criteria. 


Solution (1). Si = {x E M 3 | £1 > 0} 

The subset Si does not satisfy condition 3. For example, consider the vector 


Then since X\ — 1 > 0, the vector x G S \. Then consider the scalar product of x and the scalar 
— 1. Then we have 


(-1) - x = 



and the first entry is —1, hence — x is not in S\ . Thus S i does not satisfy condition 3 and it is not a 
subspace of R 3 . 

(You can check that conditions 1, 2 are met.) 


Solution (2). £2 = {x G M 3 | X\ — 4a; 2 + 5^3 =2} 
The zero vector of the vector space R 3 is 


0 = 


0 

0 

0 


Since the zero vector 0 does not satisfy the defining relation X\ — 4^2 + 5 x3 = 2, it is not in S 2 . 
Hence condition 1 is not met, hence S 2 is not a subspace of R 3 . 

(You can check that conditions 2, 3 are not met as well.) 


Solution (3). S3 = {x £ M 2 | y = x 2 } 

Consider vectors 


1 

1 


and 



These are vectors in S 3 since both vectors satisfy the defining relation y — x 


2 


However, their sum 


1 

1 



0 

1 


is not in S 3 since 1 7 ^ 0 2 . 

Hence condition 2 is not met, and thus S 3 is not a subspace of R . 
(You can check that condition 1 is fulfilled yet condition 3 is not met.) 


Solution (4). S4 = {f(x) G P 4 | /(1) is an integer} 

Consider the polynomial f(x) — x. Since the degree of f(x) is 1 and /(1) = 1 is an integer, it is 
in S 4 . Consider the scalar product of f(x) and the scalar 1/2 £ R. 

Then we evaluate the scalar product at x — 1 and we have 



















which is not an integer. 

Thus (1/2) /(x) is not in S 4 , hence condition 3 is not met. Thus S 4 is not a subspace of P 4 . 
(You can check that conditions 1, 2 are met.) 

Solution (5). £5 = { f{x ) G P 4 | /(1) is a rational number} 

Let f(x) — x. Then f(x) is a degree 1 polynomial and /(1) = 1 is a rational number. 
However, the scalar product y/2f{x) of f(x) and the scalar \/2 G R is not in S§ since 

V2/(l) = y/2, 

which is not a rational number. Hence condition 3 is not met and S$ is not a subspace of P 4 . 
(You can check that conditions 1, 2 are met.) 


Solution ( 6 ). Sq = {A G M^x 2 | det(-A) 7 ^ 0} 

The zero vector of the vector space M^x2 is the 2 X 2 zero matrix O. 

Since the determinant of the zero matrix O is 0, it is not in Sq . Thus, condition 1 is not met and 5^6 is 
not a subspace of • 

(You can check that conditions 2, 3 are not met as well.) 


Solution (7). S 7 = {A G \ det(-A) = 0} 

Consider the matrices 



'i 

o' 

and B — 

i 

o 

o 

A = 



0 

0 


0 1 


The determinants of A and B are both 0, hence they belong to S 7 . 
However, their sum 


A + B = 


1 

0 


0 

1 


has the determinant 1 , hence the sum A + B is not in S 7 . 
So condition 2 is not met and S 7 is not a subspace of M^x2 • 
(You can check that conditions 1, 3 are met.) 


Solution ( 8 ). Ss = {f{x) G C[- 2,2] | /(-1)/(1) = 0} 

Consider the continuous functions 


f(x) = x — 1 and g(x) — x + 1 . 








(These are polynomials, hence they are continuous.) 

We have 

/(-l)/(l) = (-2).(0) = 0and 
= ( 0 ) • 2 = 0 . 


So these functions are in S $. 

However, their sum h(x ) := f(x) + g(x) does not belong to S$ since we have 

= (/(-1) +»(-!)) (/(I) + 9(1)) 

= (-2 + 0)(0 + 2) = -4 ^ 0. 


Therefore, condition 2 is not met and S$ is not a subspace of C[— 1,1]. 
(You can check that conditions 1, 3 are met.) 


Solution (9). Sq = { f(x ) G C[— 1,1] | f{x) > 0 for all — 1 < x < 1} 

Let f(x) — x 2 , an open-up parabola. 

Then f(x) is continuous and non-negative for — 1 < x < 1. Hence f(x) = x 2 is in Sg . 
However, the scalar product ( —1 )f(x) of f(x) and the scalar —1 is not in Sg since, say, 

(- 1 )/( 1 ) = -1 


is negative. 

So condition 3 is not met and Sg is not a subspace of C[— 1,1]. 
(You can check that conditions 1, 2 are met.) 


Solution (10). 

Sio = {/(#) G C 2 [— 1,1] | f"(x) + f(x) = sin(x) for all — 1 < x < 1} 

The zero vector of the vector space C 2 [—1,1] is the zero function 9(x) — 0. 

The second derivative of the zero function is still the zero function. 

Thus, 

9"(x) + 9(x) — 0 

and since sin(x) is not the zero function, 9(x) is not in Sig. 

Hence S io is not a subspace of C 2 [—1,1]. 

(You can check that conditions 2, 3 are not met as well. 

For example, consider the function f(x) — —^xcos(x) G S io.) 

Solution (11). Let Sn be the set of real polynomials of degree exactly k. 

The set Sn is not a vector subspace of P&. One reason is that the zero function 0 has degree 0, and 
so does not lie in Sn . The set Sn is also not closed under addition. Consider the two polynomials 



f(x) — x k + 1 and g(x) =-x k + 1. Both of these polynomials lie in Sn , however 
f(x) + g{x) — 2 has degree 0 and so does not lie in Sn. 

Solution (12). The complement 

The complement S \2 — V \ W is not a vector subspace. Specifically, if 0 G V is the zero vector, 
then we know 0 G W because W is a sub space. But then 0 ^ V \ W, and so V \ W cannot be a 
vector subspace. 


Let V be the vector space over R of all real valued functions defined on the interval [0,1]. Determine whether the 
following subsets of V are subspaces or not. 

(a) S = {f(x) e V | /( 0) = /( 1)}. 

(b) T = {f(x) e V | /(0) = /(l) + 3}. 


Hint. 

To show that a subset FT of a vector space V is a subspace, we need to check that 
the zero vector in V is in IF 

Q for any two vectors IF , we have u + v G W 

Q for any scalar c and any vector nG W ,w e have cu G W . 


Solution. 


(a) Is 5 = {/(x) G H |/(0) = /(!)} a subspace? 


We show that S is a subspace of the vector space V by checking conditions (l)-(3) given in the hint above. 
First note that the zero vector in V is the zero function 0{x) , that is, 9{x ) = 0 for any x G [0,1]. 

Since we have 0(0) = 0 = S(l), the zero function 9(x ) G S . 

Condition (1) is met. 


Now, take any f{x ), g"(x) G S .By the defining relation of S , we have 

/(0) =/(!)> g(0)=g(l). 

Consider the addition (f + g) (x) . We have 

(f+^)(0) =/(0) +g(0) =/(l) +g(l) = (/ + g)(i) 

and it follows that (f + g) (x) G S . 

Thus S satisfies condition (2). 





To check the condition (3), take any scalar cGK and f(x) E S . 

Since f(x) E S , we have /( 0) — f(l) . The scalar multiplication (cf)(x) satisfies 

te0(0) = c •/(0) = c •/( 1) = (c/)(0). 

Thus (c/) (x) G 5 . 

Therefore, the subset S satisfies conditions (l)-(3). Hence S is a subspace of V . 

(b) Is T = {f{x) e V 1/(0) = /(l) + 3} a subspace? 

We claim that T is not a subspace of the vector space V. 

For example, the subset T does not satisfy condition (1). 

The zero vector of V is the zero function 6{x) . 

Then we have 


0(0) = 0^0 + 3 = 9(1) + 3, 


and hence the zero vector 9(x) E V is not in W . 


^ Let V be the vector space over R of all real valued function on the interval [0,1] and let 

W = {f(x) E V | f(x) = f( 1 - x) for x E [0,1]} 
be a subset of V. Determine whether the subset W is a subspace of the vector space V. 


Proof. 

We claim that IF is a subspace of V. 

To show the claim, we need to check that the following subspace criteria. 


Subspace Criteria. 

The zero vector in V is in W . 


^ For any two elements f(x) , g(x) E W 


we have f(x) + g(x) E W . 


0 For any scalar c and any element f(x) E W , we have cf(x) E W 


The zero vector of V is the zero function 9(x) — 0 . 
Since we have 



9{x) — 0 = 9{1 — x) 

for any xG [0,1], the zero vector 9 is in W , hence condition 1 is met. 

Let f(x) , g(x) be arbitrary elements in W . Then these functions satisfy 

fix) =/( 1-x) and g(x) = g(l - x) 

for any x E [0,1] . 

We want to show that the sum h(x ) := f(x) + g(x) is in W . This follows since we have 
h(x) = /(x) + g(x) =/( 1 - x) + g(l - x) = A(l' - x). 

Thus, condition 2 is satisfied. 


Finally, we check condition 3. Let c be a scalar and let f[x) be an element in W . 
Then we have 

fix) =/(i -x). 


It follows from this that 


c/(x) = c/(l - x), 

and this shows that the scalar product cf(x) is in W . 

Therefore condition 3 holds, and we have proved the subspace criteria for W . Thus W is a subspace of the 
vector space V . 

0 Let C[— 1,1] be the vector space over R of all real-valued continuous functions defined on the interval [—1,1]. 
Consider the subset F of C[— 1,1] defined by 

F = {f(x) e C[- 1,1] | /(0) is an integer}. 

Prove or disprove that F is a subspace of C[— 1,1]. 


Proof. 

(a) Continuous functions taking integer values at 0. 

We claim that the subset F is not a subspace of the vector space C[— 1,1] . 

If T 7 is a subspace, then F is closed under scalar multiplication. 

For instance, consider the scalar multiplication of the function f{x) — x + 1 and the scalar r — 1/2. 
Note that since/(0) = 1 is an integer, the function is in F . 

However, the scalar multiplication 

r A x ) = 1 (x +1) 

is not in F because 



r A o) = \ 

is not an integer. 

Therefore, F is not closed under scalar multiplication, hence F is not a subspace of C[— 1,1] . 

(b) Set of all n X n skew-symmetric matrices 
We prove that W is a subspace of V by showing the following subspace criteria. 

G The zero vector 0 G M nxn is in W . 

G Ifzi,5 € , then^ + B G W . 

0 If A G W, r G R , then rAeW. 

First, note that the zero vector in the vector space M nxn is the n X n zero matrix O. Since we have 

0 T = o = -o, 

the zero matrix O is skew-symmetric, hence the zero vector O is in W . So condition 1 is met. 

To prove condition 2 , let A, B be arbitrary elements in W . Thus, A and B are skew-symmetric matrices: 

A t = -AandB T = -B. (*) 

We want to prove that the addition A + B is in W , namely, we want to show A + B is skew-symmetric. 
We have 

(A + B) t = A T + B t = -A + ( -B ) = -(A+ B ), 

and this implies thatv4 + B is skew-symmetric. Hence A + B G W and condition 2 is satisfied. 

Finally, we check condition 3. Let A G W and rG R. We want to show that the scalar product 
rA G W , that is, /v4 is skew-symmetric. 

We have 

(r^4) T = rA T 

= r(—A) (A is skew-symmetric) 

= — 

which yields that rA is skew-symmetric. Hence rA G W and condition 3 is met as well. 

Thus, we have checked all the subspace criteria. Hence W is a subspace of M nxn . 

G Let P 3 be the vector space over M of all degree three or less polynomial with real number coefficient. Let W be 
the following subset of P 3 . 


W — {p(x) E P3 | p (— 1 ) — 0 andp"(l) — 0}. 

Here p' ( x ) is the first derivative of p(x) and p" (x) is the second derivative of p(x). Show that W is a subspace 
of P3 and find a basis for W. 


Proof. 

Subspace criteria 

To show that the subset W of the vector space P 3 is a subspace, we need to check that 
O the zero vector in P 3 is in W 

€> for any two vectors W , we have u + v £ W 




for any scalar c and any vector wG W ,wq have cu £ W . 


Check Condition 1 

First note that the zero vector in P 3 is the zero polynomial, which we denote 9{x ) . 

Thus we have #(x) = 0 for any x. 

The derivative of the zero function is still the zero function, we have 

6 f (x) = 0, and 9"(x) = 0. 

Therefore the zero vector 9{x) satisfies the following defining relations for W 

9f{- 1) = Oand 0"( 1) = 0 

and thus the zero vector 9 {x) is in W . 

Thus condition 1 is met. 

Check Condition 2 

To check condition 2, let/(x),g-(x) E W . 

Then/(x) and g-(x) satisfy 

/'(-l) = 0,/"(l) = 0, andg'(-l) = 0,g"(l) = 0. 

Let h(x) f(x) + g'(x) . We want to show that the sum h(x ) = f{x) + g"(x) is in W . 
We have 

A'W = {fix) +g(x))' =f'{x)+g'{x) 





and similarly we have 

h"(x) =/"(*)+g"(x). 

Thus we obtain 

h\- 1) = /'(-1) + g'(-l) = 0 + 0 = 0 and h"{ 1) =/"( 1) + g"(l) = 0 + 0 = 0. 

Therefore h(x) satisfies the defining relations of W and hence h{x) — f(x) + g(x) E W . 

So condition 2 is satisfied. 

Check Condition 3 


Condition 3 can be checked as follows. Let f{x) E W and let r E R . 
We want to show that the scalar product k{x ) := //(x) is in W . 

Since/(x) is in W , we have 

/'(—i) = o,/"(i) = o. 


Note that we have 


k'(x) = ( rf{x ))' = r/'(x), andA:"(x) = rf"{x). 


Thus we see that 

k\— 1) = rf\— 1) = r • 0 = 0 and k n { 1) = rf"( 1) = r • 0 = 0. 

Thus £(x) satisfies the defining relations of W and hence the scalar product k{x ) = rf(x) is in W . 
Therefore condition 3 and thus all conditions are met, and we conclude that W is a subspace of P 3 . 

Find a basis for the subspace W . 

To find a basis, we observe the following. 

Any vector in W is a polynomial 

p(x) — ao + a\x + CI2X 2 + a^x 3 


satisfying 


p{— 1) = 0 and p"{ 1) = 0. 


Since we have 

^'(x) = a\ + 2^2X + Za^x 2 and p" (x) = 2^2 + 6^3X5 
the above conditions become 


a\ — 2^2 “b 3^3 — 0 and 2(22 “b 6(23 — 0. 
To find solutions for these equations consider the augmented matrix 


1 

-2 

3 

1- 

O 

0 

2 

6 

-1 

O 






We apply the elementary row operations to this matrix and obtain the following reduced row echelon 
matrix. 


1 

0 

9 

1 - 

o 

1 - 

o 

i 

3 

- 1 

o 


Thus, solutions are 

CL\ — — 9(23 
(22 = 3(23 • 


Therefore, any polynomial p{x) in W can be written as 

p(x) — (2o — 9 ( 23 X — 3 ( 23 X + a^x 

— cio (1) T - ( 23 (—9x — 3x 2 T x^). 

In particular we see that the polynomials qi (x) := 1 and q 2 (x) := — 9x — 3x 2 + x 3 are in W . 
Since any vector p(x) £ W is a linear combination of qi (x) and q 2 (x) , the set { q\ (x), q 2 (x) } is a 
spanning set for W . 


We check that q\ (x) and q 2 (x) are linearly independent. 
If we have a linear combination 


c\qi (x) + c 2 ®(i) = 0 (= 0(x)), 


then we have 


9 o 

Cl — 9C2X — 3C2X + C2X = 0. 

Thus we see that C\ — C 2 — 0 and the vectors q\ (x) and q 2 (x) are linearly independent. 

Therefore (x), q 2 (x) } is a linearly independent spanning set for W , and thus it is a basis for W . 


In summary we found a basis 


{1, — 9x — 3x 2 + x 3 }. 


Remark: The dimension of W is 2. 


O Let V be the vector space over M consisting of all n X n real matrices for some fixed integer n. Prove or 
disprove that the following subsets of V are subspaces of V. 

(a) The set S consisting of all n X n symmetric matrices. 

(b) The set T consisting of all n X n skew-symmetric matrices. 

(c) The set U consisting of all n X n nonsingular matrices. 

Hint. 

Recall that 

^ a matrix A is symmetric if A T = A. 





a matrix A is skew-symmetric if A T = -A. 


Proof. 

To show that a subset FT of a vector space V is a subspace, we need to check that 


the zero vector in V is in W 


for any two vectors u,v E W, we have u + v E W 


for any scalar c and any vector u E W, we have cu E W. 


(a) The set S consisting of all n x n symmetric matrices. 

We will prove that S is a sub space of V. The zero vector O in V is the n x n zero matrix and it is 
symmetric. Thus the zero vector O E S and the condition 1 is met. 


To check the second condition, take any A, B E S, that is, A , B are symmetric matrices. 

To show that A + B E S, we need to check that the matrix A + B is symmetric. 

We have 

+ B) T = A T + B T = A + B 

since A, B are symmetric. Thus A + B is also symmetric, and A + B E S. Condition 2 is also satisfied. 


Finally, to check condition 3, let A E S and let r E R. We show that rA E S , namely, we show that rA 
is symmetric. 

We have 


(rA) T = rA T = rA 

since A is symmetric. Thus rA is symmetric and hence rA E S. 

Thus condition 3 is met. 

By the subspace criteria, the subset S is a subspace of the vector space V. 
(b) The set T consisting of all n x n skew-symmetric matrices. 




We will prove that T is a subspace of V. 

The zero vector O in V is the n x n matrix, and it is skew-symmetric because 

o T = o = - a 


Thus condition 1 is met. 


For condition 2, take arbitrary elements A,BeT. The matrices A, B are skew-symmetric, namely, we 
have 

A t = - A and B T = - B. 

We show that A + B E T 7 , or equivalently we show that the matrix A + B is skew-symmetric. 

We have 

* 

04 + B) 1 = + 5 t = A l ( - /i) -» ~(A+B). 

Therefore the matrix A + B is skew-symmetric and condition 2 is met. 


To prove the last condition, consider any A E T and r E R. 

We show that rA is skew-symmetric, and hence rA E T. 

Using the fact that A is skew-symmetric (A = - A), we have 

(rA) T = rA T = r( - A) = -r,4. 

Hence rv4 is skew-symmetric and condition 3 is satisfied. 

By the subspace criteria, the subset T is a subspace of the vector space V. 

(c) The set U consisting of all n x n nonsingular matrices. 

We claim that U is not a subspace of V. 

As the zero vector of V is the n * n matrix and the zero matrix is singular, the zero vector is not in U. 
Hence condition 1 is not met, and thus U is not a subspace. 


Another reason that U is not a subspace is that the addition is not closed. For example, 

if A is a nonsingular matrix (say, A is n x n identity matrix), then -A is also nonsingular matrix but 

their addition A+ (-A) = O is nonsingular, hence it is not in U. 


$ Let V be the vector space of n X n matrices with real coefficients, and define 

W — {v E V | vw = wv for all w £ V}. 
The set W is called the center of V. Prove that W is a subspace of V. 


Proof. 

We must show that W satisfies the three criteria for vector subspaces. 

Namely, the zero vector of V is in W , and W is closed under addition and scalar multiplication. 

First, the zero element in V is the matrix 0 whose entries are all 0. For any other matrix xG V , we have 
Ox = 0 = xO . So we see that 0 E W . 


Now suppose v, w E W and c E R. Then for any x E V , we have 

(v + w)x = vx + wx = xv + xw = x(v + w), 

where the second equality follows because v and w lie in W . So we see that v + w £ W as well, and 
so W is closed under addition. 


Finally we must show that cv E W as well. For any other xG V , we have 

(cv)x = c(vx) = c(xv) = x(cv), 

where the second equality follows from the fact that v G W and so vx = xv . 

Thus we see that cv E W , finishing the proof. 

Let C(R) be the vector space of real-valued functions on R. Consider the set of functions 
W — {f(x) — a + bcos(x) + ccos(2cc) | a, 6, c G R}. Prove that W is a vector subspace of C(R). 

Proof. 

We verify the subspace criteria: the zero vector of C(M) is in W, and W is closed under addition 
and scalar multiplication. 

First, the zero element of C(M) is the zero function 0 defined by 0(x) — 0. This element lies in 
W, as 0 (x) = 0 + 0 cos(a:) + 0 cos( 2 x). 

Now suppose fi(x), f 2 (x) E W, say fi(x) — a\ + b\ cos(x) + C\ cos(2x) and 
f 2 {x) — ct 2 + &2 cos(a:) + C 2 cos(2x). Then 

fi(x) + f 2 (x) = (ai + a 2 ) + ( 6 i + b 2 )cos(x) + {c\ + c 2 )cos( 2 x) 

and so fi (x) + / 2 (x) E W. Finally, for any scalar d E R, we have 

df\{x) = (aid) + ( 6 id)cos(#) + (cid) cos( 2 x), 

and so dfi ( x ) E W as well. 

This proves that W is a subspace of W. 



0 


For what real values of a is the set 


Wa = {/ G C(R) I /(0) = a} 


a subspace of the vector space C (R) of all real-valued functions? 


Solution. 

The zero element of C(R) is the function 0 defined by 0(x) = 0 . This shows that 0 £ W a if and only 
if a = 0. 


We have shown that if a 0, then is not a subspace as every subspace contains the zero vector. Now 
we consider the case a = 0 and prove that W o is a subspace. 


We verify the subspace criteria: the zero vector of C(R) is in Wo, and W o is closed under addition and 
scalar multiplication. 

As mentioned before, 0 G Wq . 


Now suppose/,gG fFo • Then/(0) = g(0) = 0 , and so 

(/+g)(0 )=/(0)+g(0 ) = 0. 

Thus/+ g G fFo • Finally, if c G R is a scalar and f £ W o , then 

(c/)(0)=c/(0)=c-0 = 0. 

Thus cf £ W o , and JT^o is a vector subspace. 

Let V be the vector space of n X n matrices, and M £ V a fixed matrix. Define 

W = {A £ V | AM = MA}. 

The set W here is called the centralizer of M in V. Prove that W is a subspace of V. 


Proof. 

First we check that the zero element of V lies in W . The zero element of V is the n X n zero matrix 0 . 
It is clear that MO = 0 = 0M , and so 0 G W . 

Next suppose A, B £ W and c £ R. Then AM — MA and BM — MB , and so 
(A + B)M = AM + BM = MA + MB = M (A + B). 

Thus, A + B £ W . 





We also have 


(cA)M = c(AM) = c(MA) = M(cA ), 


and so cA E W . 

These three criteria show that FT is a subspace of V . 


Let y be the vector space of k X k matrices. Then for fixed matrices R, S E V , define the subset 
FF = {iL4S | A E y}. Prove that FF is a vector subspace of y. 

Proof. 

We verify the subspace criteria: the zero vector of V is in FF, and W is closed under addition and 
scalar multiplication. 


First, let 0 E V be the k X k zero matrix. Then ROS — 0, and so 0 E W. 


Now suppose X,Y E W. Then there are elements A, B E V such that RAS — X and 
RBS = Y. Then 


X + Y = RAS + RBS = R(A + B)S 


and soX + Y E W. 


Now for a scalar cEl and matrix X = RAS E W, we have 

cX = cRAS = R(cA)S, 


and so cX E W as well. 


Let y denote the vector space of all real n X n matrices, where n is a positive integer. Determine whether the set 
U of all n X n nilpotent matrices is a subspace of the vector space V or not. 


Solution. 

We claim that the matrix B — A is not necessarily invertible. 
Consider the matrix 


A = 


0 

0 



This matrix is nilpotent as we have 






1 

1—1 

1 

o 


1 

o 

1 

I— 1 
_1 


1 

o 

o 
_1 

1 

o 

o 
_1 


1 

o 

o 

1_ 


1 

o 

o 

1_ 


Also consider the matrix 


B = 


1 

1 


0 

1 


Since the determinant of the matrix B is 1, it is invertible. 


So the matrix A and B satisfy the assumption of the problem. 
However the matrix 


is not invertible as its determinant is 0. 

Hence we found a counterexample. 

Related Question. 

Here is another problem about a nilpotent matrix. 


Problem. 

Let A be an n X n nilpotent matrix. Then prove that / — A, I + A are both nonsingular matrices, 
where / is the n X n identity matrix. 


Let P be a vector space over a scalar field K . 

Let V]_ , V 2 ,..., v#. be vectors in P and consider the subset 

W {aivi + a 2 v 2 H-b a k w k | a 2 ,..., a k £ K and a\ + a 2 H-b a k = 0}. 

So each element of W is a linear combination of vectors Vi ,..., v& such that the sum of the coefficients is zero. 
Prove that W is a subspace of V. 

Proof 1. (Subspace Criteria) 


We use the following sub space criteria. 

The subset VP is a subspace of V if the following three conditions are met. 


The zero vector in V is in VP. 


For any two elements v, v r E VP, we have v + v' £ VP. 


For any scalar c £ K and any element v £ VP, we have cv £ VP. 












The zero vector 0 of V can be written as 

0 = Ovi + 0v 2 H-h 0v fc . 

Clearly the sum of the coefficient is zero, hence 0 £ W. 

So condition 1 is met. 


To verify condition 2, let 


v = aivi + a 2 v 2 H-h d k w k 


and 

v' = feiVi + b 2 v 2 H-h b k v k 

be arbitrary elements in W. Thus 

&i + a 2 + • • • + a& = 0 and &i + & 2 + • • • + b k — 0 . (*) 

The sum v + v r is 

v + v' = (aivi + a 2 v 2 H-b a k v k ) + (&1V1 + & 2 v 2 H-b b k v k ) 

= («i + &i)vi + (a 2 + & 2 )v 2 H- b (a* + 


The the sum of the coefficients of the above linear combination is 

(ai + b\) + (a 2 + b 2 ) + • • • + ( d k + b k ) 

— (&i + ft 2 + * * * + d k ) + ( b\ + b 2 + • • • + b k ) — 0 + 0 = 0. 


It follows that the sum v + v r is in W, and hence condition 2 is met. 


Finally, let us check condition 3. Let 

v = aivi + a 2 v 2 H-h 

be an arbitrary vector in W and let c £ if. 

Since v £ W*, we have 

&1 + n 2 + • • • + d k = 0. 


Then the scalar product is 

CV = c(oiVi + a 2 v 2 H-h afeVfe) 

= caivi + ca 2 v 2 H-h ca^Vfe. 


The sum of the coefficients of the linear combination is 

cd\ + c< 2 2 + • • • + cd k = c(< 2 i + d 2 + • • • + n&) = n0 = 0. 


Hence cv G V, and condition 3 is met. 

Therefore by the subspace criteria, we conclude that W is a subspace of V. 

Proof 2. (Span) 

Consider an arbitrary vector in W : 

aivi + a 2 v 2 H-b a&v& with ai + a 2 H-+ a*; = 0. 

Substituting the relation ak — — (cti + a 2 + • • • + a&_i), we obtain 
aivi + a 2 v 2 H-b a&_iv&_i + a&v fc 

= aivi + a 2 v 2 H-+ afc-iv fc _i - (ai + a 2 H-b ak- i)v fc 

= ai(v! - v*.) + a 2 (v 2 - v fc ) + ■ • • + - v fc _i). 

This computation yields that every vector in W is a linear combination of vectors in 
S := (vi - v fc ,v 2 - Vfc,...,v fe _i - v fc _i}. 

That is, we have W C Span (S'). 


On the other hand, let 

v = Ci(vi - Vfe) + C 2 (v 2 — Vfe) H-b Cfc_i(v fe _i - Vfe_i) 

be an arbitrary vector in Span(S'). 

Then we have 

V = C1V1 + c 2 v 2 H- b Ck-i-Vk -1 - (Cl + C 2 H- b Cfe_i)v fe . 

This is a linear combination of Vi , V2 ,. . ., v^, and the sum of the coefficients is 
Cl + C 2 + * • * + C k -l ~ (Ci + C 2 + * * * + C k - 1) = 0. 
Therefore v E VT. Thus we also have Span(S') C VT. 


Putting together these inclusion yields that W — Span(iS). 

As the span is always a subspace, we conclude that W is a subspace of V. 


O 


Suppose that S is a fixed invertible 3 by 3 matrix. This question is about all the matrices A that are diagonalized 
by S so that S' -1 AS is diagonal. Show that these matrices A form a subspace of 3 by 3 matrix space. 


(MIT) 




Check the following criteria for a subset to be a subspace. 

Theorem. ( Subspace criteria ) 

A subset W of a vector space V is a subspace if and only if 
The zero vector in V isinJV. 

^ For any vectors A, B £ W , the addition A + B £ W . 

0 For any vector^ £ W and a scalar c, the scalar multiplication cA £ W . 


Proof. 

Let V be the vector space of all 3 by 3 matrices. 

Define the subset of V 

W — {A £ V | S~ l AS is diagonal.}. 

We want to show that W is a subspace of V . 

To do this, it suffices to show the following subspace criteria; 

0 The zero vector in V is in W . 

0 For any vectors A, B £ W , the addition A + B £ W . 

0 For any vector A £ fF and a scalar c, the scalar multiplication cA £ W . 


The zero vector in the vector space V is the 3 by 3 zero matrix O . 
Since S ~ l OS = O , we have O £ W . 


To show the second criterion, take A , 5 £ JF . 

Then we have 5 _1 (A + 5)5 = 5 _1 AS + S _1 55 and since A, B £ W , the matrices 
S -1 v45, S ~ l BS are diagonal. 

The addition of diagonal matrices is still diagonal, thus 5 _ 1 (^4+5)5 is diagonal andv4 + B £ W . 




We finally check the third criterion. 

Take A E W and let c be a scalar. 

Then we have S -1 ( cA)S — cS ~ l AS . The matrix S ~ l AS is diagonal since A £ W . 

Since a scalar multiplication of a diagonal matrix is still diagonal, we conclude that the matrix S -1 ( cA)S 
is diagonal and cA E W . 

Therefore the subset W of V satisfies the criteria to be a subspace. 

© Let P 2 be the vector space of polynomials of degree 2 or less. Prove that the set {1,1 + x, (1 + x) 2 } is a basis 
for P 2 . 


Proof. 

(a) Prove that the set {1, 1 + x, (1 + x ) 2 } is a basis for P 2 . 

Consider the standard basis B = of P 2 . Using this basis, we can write the elements using 

coordinate vectors as 


[1]b = 

" 1 " 

0 

[l + x] B = 

" 1 " 

1 

[(1 + *) 2 ]b = 

'1 

2 


0 


0 


1 


We find the coordinate vector by writing an element as a linear combination of the basis elements. For 
example, {1 + x) 2 = 1 + 2x + lx 2 , and so the coefficients 1, 2,1 translate into the column vector 

" 1 " 

2 . 

1 


Because dim P 2 = 3, this set is a basis if and only if these three vectors are linearly independent. To 
verify this, consider the matrix 

"1 1 1 " 

0 12 . 

_0 0 1 _ 

This matrix is upper-triangular, and the diagonal entries are all non-zero. This implies the matrix is non¬ 
singular, and so the columns are linearly independent. 

Thus, the set {1,1 + x, (1 + x) 2 } is a basis of P 2 . 

(b) Write the polynomial f(x) = 2 + 3x— x 2 as a linear combination of the basis 

{M + *, (1 + *) 2 }- 

First Method: Using the technique of completing the square, we can factor the polynomial f(x) as we like. 
Specifically, 













f(x ) = — x 2 + 3x + 2 

=— ■ (a? + l) 2 + 5a? 3 

——{x + l) 2 + 5(a? + 1)—2. 

Hence, we have the linear combination 

f(x) = -2 • 1 + 5(1 + x) - (1 + xf. (*) 


Second Method: We can find this factorization by calculating the Taylor polynomial of /(a?) centered at 
- 1 . 

This Taylor polynomial is defined by 

/(*) = /(- 1 ) + /'(- 1 )(* + 1 ) + + 1 ) 2 - 

The polynomial in (*) is recovered by finding /( —1) = — 2, f f ( — 1) = 5, and/ // ( —1) = —2. 


^ Let V be a real vector space of all real sequences 

MZi = ( a i5 a 2?'' *)• 

Let U be the subset of V defined by 

U — {( a i)^l ^ ^ I a k +2 — § a k +1 + 3 a k = 0 , k = 1 , 2 , . . .}. 
Prove that U is a subspace of V. 


Proof. 

We prove the subspace criteria. 


Subspace criteria 

The zero vector in V is in U. 


For any two elements (a* )^ 1 , (6/ )^ 1 E £/, we have (a* )^ 1 + (Z? z - ) z ^ 1 E £/. 


© For any scalar c and any element (a z - )^ 1 E C/, we have c(a z - )^ 1 E £/. 


Condition 1. 




The zero vector in V is the zero sequence (0) = (0,0,0,...). Clearly, this sequence satisfies the 
recurrence relation cik+2 ~ 1 + 3ak — 0. Thus the zero vector (0) G U, hence condition 1 is met. 


Condition 2. 


To check condition 2, take (a z - )^ 1 , (bj )™ 1 G U. 
We want to show that the sum 


(«/)£ 1 + &)% 1 = («* + H=1 


is also in U. 


Since (a/ )^ 1 , (Z?/ )^ 1 G U, these sequences satisfy the recurrence relation 

a k +2 ~ 5a>t+i + 3a^ = 0 (*) 


and 


Z >&+2 — + 3 Z ?£ — 0 


for k — 1, 2 ,... . 

Using these two relations, we have 

(tf k+2 + ^+ 2 ) — 1 + + 3(a^ + 

— (^+2 — 5a^ + i + 3a^) + (bk+ 2 — 5Z?^ + i + 3Z?^) 

= 0 + 0 = 0 . 

Thus the sum (a/)^ 1 + (Z?/)^ 1 = (a z - + satisfies the recurrence relation and it is also in U. 

Hence condition 2 is met. 


Condition 3. 

To check condition 3, take (a* )^ 1 G £/ and let c G R be a scalar. 

Then the sequence (a* )+ 1 satisfies (*). Multiplying (*) by c, we have 

( ca k + 2 ) - 5(ca*+i) + 3 (ca k ) = 0. 

This implies that the scalar product c(a ; )^ 1 = (ca ; )^ 1 satisfies the recurrence relation, and hence it is 
in £/. Thus condition 3 is satisfied. 

We have checked all subspace criteria, and thus U is a subspace of the vector space V . 

0 Let V be a vector space over a field K . If W\ and W 2 are subspaces of V, then prove that the subset 

W 1 + W 2 := {x + y |xe Wi,ye W 2 } 


is a subspace of the vector space V. 

Proof. 

We prove the following subspace criteria: 

^ The zero vector 0 of V is in W\ + W2 . 


^ For any u, v £ W\ + W2 , we have u + v £ W\ + W2 . 


0 For any v £ Wi + W2 and r £ if, we have rv £ Wi + W2. 


Since W\ and W 2 are subspaces of V, the zero vector 0 of V is in both W\ and W 2 . 
Thus we have 


0 = 0 + 0 eWi + w 2 . 


So condition 1 is met. 


Next, let u, v £ W\ + W 2 . 

Since u £ W\ + W 2 , we can write 

u = x + y 

for some x £ W\ and y £ W 2 . 

Similarly, we write 

f 1 f 

v = x + y 

for some x.' £ VFi and y' £ W 2 • 

Then we have 

u + V = (x + y) + (x' + y') 

= ( x + x') + (y + y')« 

Since x and x' are both in the vector space W\ , their sum x + x r is also in W\ . 
Similarly we have y + y r £ Vl ^2 since y, y f £ 

Thus from the expression above, we see that 

u + v £ W\ + W 2 , 


hence condition 2 is met. 


Finally, let v E W\ + W 2 and r E K. 

Then there exist x E W\ and y E W 2 such that 

v = x + y. 

Since W\ is a subspace, it is closed under scalar multiplication. Hence we have rx E W \. 
Similarly, we have ry E W 2 . 

It follows from this observation that 

rv = r(x + y) 

= rx + ry E W\ + W 2 , 


and thus condition 3 is met. 


Therefore, by the subspace criteria Wi + W 2 is a subspace of V. 
Related Question. 


Let U and V be finite dimensional subspaces in a vector space over a scalar field K. 
Then prove that 

dim (U + V) < dim(?7) + dim(Tf). 


For a proof, see the post “Dimension of the sum of two subspaces“. 



Linearly Independency of General Vectors 


Definition 


Let V be a vector space over a scalar field K. Let = {vi ,. . ., V&} be a set of vectors in V. 
We say that S is linearly independent if whenever c\ Vi + • • • + C&V& = Owe have 

ci = • • • = c k = 0. 


Problems 

Let C [3,10] be the vector space consisting of all continuous functions defined on the interval [3,10]. Consider 
the set S — {y/ 3C , X 2 } in (7[3,10]. Show that the set S is linearly independent in C [3,10]. 


Proof. 

Note that the zero vector in C [3,10] is the zero function 9{x) — 0 . 
Let us consider a linear combination 


aiy/ic + a^x 1 — 9{x) — 0. 

We want to show that ai — <22 — 0. 

Since this equality holds for any value ofx between 3 and 10, letting x = 4 andx = 9 yields the system 
of linear equations 

2ai + 4^2 = 0 
3(2i + 81(22 — 0. 

Solving this system, we obtain a\ — =0, and hence the set S is linearly independent. 

Comment. 

We could have taken any two values between 3 and 10 for x instead of 4 and 9, but the choice x = 4 and 
x = 9 made solving the system easier. 

Also, we took two values for x because we had two unknowns a\ , , and thus we needed two equations 

to determine these unknown. 

For a similar problem, show that the set { y/x , x 2 , x} in the vector space C[l, 10] is linearly 
independent. 

In this case, there will be three unknowns that you want to show to be zero. 

So you need to take three values for x. 

A natural choice will be x = 1,4, 9 . 

Let C[— 7T, 7 r] be the vector space of all continuous functions defined on the interval [—7T, 7r] . Show that the 
subset {cos(cc), sin(cc)} in C\— 7T, 7r] is linearly independent. 













Proof. 

Note that the zero vector in the vector space C[—n, n\ is the zero function 

9{x) 0. 

Let us consider a linear combination 

a\ cos(x) + U 2 sin(x) = 9(x ) = 0. (*) 

If this linear combination has only the zero solution a\ — a 2 — 0, then the set {cos(x), sin(x)} is 
linearly independent. 


The equality (*) should be true for any values of x G [—7T, n\ . 

Setting x = 0, we obtain from (*) that 

^i=0 

since cos(0) = l,sin(0) = 0. 

We also set x = 7 t/ 2 and we obtain 

a 2 = 0 

since cos(7t/2) = 0,sin(7r/2) = 1 . 

Therefore, we have a\ = (12 = 0 and we conclude that the set {cos(x), sin(x)} is linearly 
independent. 


Let C[— 27T, 27t] be the vector space of all continuous functions defined on the interval [— 27T, 27t] . Consider the 
functions f(x) = sin 2 (cc) and g(x) — cos 2 (x) in C[— 27T, 27r]. Prove or disprove that the functions f(x ) and 
g{x) are linearly independent. 

(The Ohio State University) 


Proof. 

To determine whether f(x) and g(x) are linearly independent or not, consider the linear combination 

C\f{x) + c 2 g(x ) = 0, 


equivalently 


ci sin 2 (x) + C 2 cos 2 (x) = 0, 


n 


where C\ , C 2 are scalars. 

If the only scalars satisfying the above equality are c\ — 0, C 2 — 0 , then /(x) and g-(x) are linearly 
independent, otherwise they are linearly dependent. 




Note that this is an equality as functions. 

That is, this equality must hold for any x in the interval [—27T, 27 t] . 

Letx = 0 . Then as sin(O) = 0 and cos(O) = 1, we obtain C 2 — 0 from (*). 

Next, letx = 7r/ 2 . Then as sin(7r/2) = 1 and cos(7t/2) = 0 , we obtain c\ — 0 from (*). 

Therefore, we must have c\ = — 0 , and hence the functions f{x) — sin 2 (x) and g(x) = cos 2 (x) 

are linearly independent. 

Comment. 

This is one of the midterm 2 exam problems for Linear Algebra (Math 2568) in Autumn 2017. 

Here is the most common mistake. 

The linear combination Ci sin 2 (x) + C 2 cos 2 (x) is a function defined over the interval [—27T, 27 t] and 
we are assuming it is the zero function. 

So saying that “if Ci = 1, C 2 = 0 , then Ci sin 2 (x) + C 2 cos 2 (x) is zero at x = 0 , hence/(x) and 
g’(x) are linearly independent” is totally wrong. 

What you are claiming here is that the function sin 2 (x) is zero at x = 0 , hence it is the zero function. 
This is clearly wrong as sin 2 (x) is not the zero function. 

Let f{x ) = sin 2 {x),g{x) — cos 2 (x), and h{x) — 1. These are vectors in C[— 1,1], Determine whether the 
set {/(#), g(x ), h(x)} is linearly dependent or linearly independent. 

(The Ohio State University) 


Solution. 

We claim that the set is linearly dependent. To show the claim, we need to find nontrivial scalars Ci, C 2 , C 3 
such that 

c lf( x ) + c 2 g(x) + C 3 h(x) = 0 . 

From trigonometry, we know the identity 

sin 2 (x) + cos 2 (x) = 1. 

This implies that we have 

sin 2 (x) + cos 2 (x) — 1 = 0. 

So we can choose Ci = 1, C2 = 1, C 3 = — 1 , and thus the set is linearly dependent. 

Let C \, C 2 ,. . ., c n be mutually distinct real numbers. Show that exponential functions 

l x ^ g c n x 


are linearly independent over R. 




o 


Consider a linear combination 


aie ClX + a 2 e C2X H-h a n e c " x — 0. 


Q Differentiate this equality « — 1 times and you will get n equations. 


Write a matrix equation for the system. You will see the Vandermonde matrix. 


Proof. 

Suppose that we have a linear combination of these functions that is zero. 
Namely, suppose we have 


a \e ClX + ci 2 e C2A + 


c 2 x 


a n e 


= 0 


for some real numbers a\ , CI 2 ,.. •, CL n . 

We want to show that the coefficients a \, CL 2 ,..., a n are all zero. 


By differentiating the equation, we obtain 


a\C\e ClX + a 2 c 2 e C2X + • • • + a n c n e CnX — 0. 
Differentiating repeatedly we further obtain the equalities 

a\c\e ClX + ci 2 c\e C2X + • • • + a n c\e CnX = 0 

CL\£^C ClX Cl2C^6 C2X + • • • + d n C^6 CnX — 0 

+ a 2 c n 2 ~ 1 e C2X + • • • + a n c n ~ l e CnX = 0 

We rewrite these n equations into the following matrix equation. 


1 

1 

1 




Cl 

Cl 

• • Cn 


a\e ClX 


"O' 

cl 

to to 

r 2 

• • n 


a 2 e C2X 






_a n e d ' x _ 


.0. 







cl 1 

ct l • 

n-1 

. . u n 





The determinant of the left matrix is 












det 


1 

c\ 


L C 1 


n —1 


1 

C2 


W— 1 


by the Vandermonde determinant. 


1 

Cn 


= 11(9 ~ c i) 

i<j 


1 


Since by assumption C \,. . ., c n are distinct, the determinant is not zero. 
Therefore by multiplying equality (*) by the inverse on the left, we obtain 


aie ClX 


"o' 

a,2e C2X 

— 

d 1 x 

la n e cx J 


.0. 


Since the functions e CiX are always positive, we must have a\ — a 2 — • • • = a n — 0 as required. 
Therefore the functions e ClX ,..., e CnX are linearly independent. 


Comment. 

The determinant that we considered above is called the Wronskian for the set of functions 
{e ClX ,e C2X ,...,e c " x }. 

Related Question. 

The following problems are more concrete versions of the current problem. 


Problem. 

Let C[— 1,1] be the vector space over R of all continuous functions defined on the interval [—1,1]. 
Let 

V := {/(x) G C[ —1,1] | /(x) = ue x + be* x + ce^ x ^ci^b^c G M} 
be a subset in C[— 1,1] . 

(a) Prove that V is a subspace of C[— 1,1] . 

(b) Prove that the set 5 = {e x , e 2x , e 3x } is a basis of C. 

(c) Prove that 

B ' = {e x - 2e 3x ,e x +^ 2x + 2e 3x ,3e 2x + e 3x } 

is a basis for V. 


See the post 1 , 

Exponential Functions Form a Basis of a Vector Space 
for the solution. 








By calculating the Wronskian, determine whether the set of exponential functions 

{e x ,e 2x ,e 3x } 

is linearly independent on the interval [—1,1]. 


The solutions is given in the post ^ 

Using the Wronskian for Exponential Functions, Determine Whether the Set is Linearly Independent 


Let V be an n-dimensional vector space over a field K. 

Suppose that Vi, v 2 ,.. ., v*. are linearly independent vectors in V. Are the following vectors linearly 
independent? 


Vl+V 2 , V2+V3, Vfc_i+V fc , Vfc+Vi. 

If it is linearly dependent, give a non-trivial linear combination of these vectors summing up to the zero vector. 



0 


Consider a linar combination of these vectors summing up to the 


zero vector. 


0 Check whether the coefficients are zero or not. 


0 To do this, form a matrix equation. 


Solution. 

Consider a linear combination 

Ci(vi +v 2 ) + C 2 (v 2 + v 3 ) H- + V k ) + Clc(yic + Vi) = 0. 

Collecting similar terms, we obtain 

(ci + C*)v 1 + (ci + C 2 )V 2 H-1- (Ck -2 + Ck-l)vk -1 + {Ck -1 + Ck)Vk = 0 . 

Since Vi, v 2 ,..., v* are linearly independent, we have 

Ci + Ck = 0 

ci + c 2 = 0 

Ck—2 + Ck-1 — 0 
Ck -1 + Ck — 0 

This is equivalent to the following matrix equality 




"1 

0 

0 ... 

0 

0 

r 



1 

1 

0 ... 

0 

0 

0 


Cl 

0 

1 

1 ... 

0 

0 

0 


C2 

0 

0 

0 ... 

1 

1 

0 


C/c—l 

. c k _ 

.0 

0 

0 ... 

0 

1 

1 . 



We reduce the coefficient matrix using elementary row operations and obtain the matrix 


'1 

0 

0 ... 

0 

0 

1 

0 

1 

0 ... 

0 

0 

-1 

0 

0 

1 ... 

0 

0 

1 

• • O 

0 

0 ... 

0 

1 

. . T-1 

.0 

0 

0 ... 

0 

0 

1 -(-!)*._ 


Thus if k is odd, then the system has only the trivial solution q = 0 . 

On the other hand, if k is even, then the matrix is singular. Hence we have a non-trivial solution. 

For example, letting ck — 1, we see that 

Cl — — 1,C2 = 1,C3 = —1,C4 = 1, . . . ,Ck-l = l,Ck = 1 
is a non-trivial solution. 

Thus we have when k is even 

-(vi + V 2 ) + (v 2 + v 3 )-h (vfc -1 + V*) - (vk + Vl) = 0. 

Comment. 

We used elementary row operations in the solution because we wanted to have a nontrivial linear 
combination. 

From the computation, the determinant of the coefficient matrix can be computed easily. 

The other method to calculate the determinant is to use the cofactor expansion. See problem Calculate 
determinants of matrices for details. 


0 Let V be a vector space over a scalar field K. Let S — {vi , V 2 ,..., v n } be the set of vectors in V, where 

n > 2. Then prove that the set S is linearly dependent if and only if at least one of the vectors in S can be written 
as a linear combination of remaining vectors in S. 


Proof. 

( =r- ): If.S' is linearly dependent, then a vector is a linear combination 















Suppose that the set S — {vi , v 2 , . . ., } is linearly dependent. Then there exists scalars 

C 1 ? c 2 ? • • • 5 c n in K such that 


C 1 V 1 + c 2 v 2 H-h C n v n = 0 


and not all of c \,..., c n are zero. 

(This is just the definition of linearly dependency.) 

Since at least one of C\ ,. . ., c n is nonzero, without loss of generality we may assume that C\ ^ 0 . 
Then we have 


c lVl = -C 2 V 2 - C3V3- C n Y n 


and we divide this by C\ 7 ^ 0 and obtain 


C 2 c 3 c n 

vi =- V 2 -V 3 - *-V„. 

Cl Cl Cl 


Therefore, we have written the vector v 1 as a linear combination ofv 2 ,V3,...,v w . 

): If a vector is a linear combination, then S is linearly dependent 

Suppose now that some vector of S is a linear combination of the remaining vectors. 

Without loss of generality, we may assume that v 1 is a linear combination of the remaning vectors. So we 
have 



vi = c 2 v 2 + C3V3 H - b c n v n , 

for some scalars c 2 , C3 ,..., c n in K . 

This implies that we have 


Vi - c 2 v 2 - C3V3 - c„v n = 0, 

and this is a nontrivial linear combination of vectors in S because the coefficient of vi is 1 ^ 0. 
Thus, the set S is linearly dependent. 


Let V be a vector space over a field K. Let Ui, u 2 ,..., u n be linearly independent vectors in V. Let V be the 
subspace of V spanned by these vectors, that is, U — Spanjui, u 2 ,..., u n }. Let u n+ i E V. Show that 
Ui, u 2 ,. . ., u n , u n+ i are linearly independent if and only if u n+ i ^ U. 


Proof. 

(=0 

Suppose that the vectors Ui, u 2 ,..., u w , u n +i are linearly independent. If u n +i E U , then u n +i is a 
linear combination of m, u 2 ,. . ., M n . 

Thus, we have 


u n+1 = ciui + c 2 u 2 H-h c n u n 



for some scalars C\ , C2, ..., c n G K . 

However, this implies that we have a nontrivial linear combination 

ciui + c 2 u 2 H-b c n M n - u„+i = 0. 

This contradicts that Ui, U2,. . ., , u^ + i are linearly independent. Hence u^ + i ^ U. 


( <= ) Suppose now that u^+i ^ U. 

If the vectors Ui, U2,..., u n , u w+ i are linearly dependent, then there exists Ci, c 2 ..., c n , c w +i G AT 
such that 

not all of them are zero and 


ciui + c 2 u 2 H-b c n w. n + c„ + iu„ + i = 0. 

We claim that c n +\ 7^ 0 . If c n +\ = 0 , then we have 

C 1 U 1 + c 2 u 2 H-b C n VL n = 0 

and since Ui, U 2 ,. . ., u w are linearly independent, we must have Ci = C 2 = • • • = c n — 0 . This 
means that all c* are zero but this contradicts our choice of C \. Thus c n + 1 7^ 0 . 

Then we have 


-c\ ~c 2 ~c n 

1 = -ui + -u 2 H-b -u„. 

Cn +1 C n +1 C n +1 

(Note: we needed to check c n +\ 7^ 0 to divide by it.) 

This implies that u^+i is a linear combination of vectors Ui, 112, . . . , U n , and thus u^+i G U, a 
contradiction. 

Therefore, the vectors Ui, U2,. . ., u w , u^+i are linearly independent. 

By calculating the Wronskian, determine whether the set of exponential functions {e x , e 2x , e 3x } is linearly 
independent on the interval [—1,1]. 


Solution. 

The Wronskian for the set {e x , e 2x , e 3x } is given by 


W(x) := 


e x 

e 2x 

e 3x 

e x 

2e 2x 

3e 3x 

e x 

\e 2x 

9e 3x 


We compute 


W(x) = e x e 2x e 3x 


1 

1 

1 


1 

2 

4 


1 

3 

9 


= 2e 


6 x 









Since the Wronskian W (x) = 2e Qx is never zero, we conclude that the set { e x , e 2x , e 3x } is linearly 
independent. 


(Note that in general, we just need to show that W(x o) 7^ 0 for some point x 0 G [ 0 , 1 ]. 

For example, since we have W(0) = 2 7^ 0 , we can conclude that { e x , e 2x , e 3x } is linearly 
independent.) 

Related Question. 

Try the following generalized version. 

Let c \, C2 ,. . ., c n be mutually distinct real numbers. 

Show that functions 

e ClX ,e C2X ,... ,e CnX 

are linearly independent over M. 

The solution is given by Exponential Functions are Linearly Independent. 


Bases and Coordinate Vectors 


Definition 


Let V be a vector space over a scalar field K. 


O 

o 


A basis B for V is a linearly independent spanning set for V. 

Suppose that B = {vi,. . ., v n } is a basis for V. Let v £ V and write it as 
v = Ci Vi + • • • + c n v n , where Ci,.. ., c n £ K. Then the coordinate vector of v with respect to 
the basis B is 


Ms = 


ci 


Summary 


Let V be a vector space over a scalar field K. Suppose that B = {vi,..., v n } is a basis for V. Let 
S = {wi,.. ., w&} be a set of vectors in V . Let T — {[wi]#,. . ., [w&]#} be the set of the coordinate 
vectors of S . 

Any vector v can be uniquely written as v = Ci Vi + • • • + c n v n , where Ci,.. ., c n £ K. 

The dimension of the coordinate vector [v] b is the dimension of the vector space V. 

The set S is linearly independent if and only if T is linearly independent. 

Q S is a basis for Span(S') if and only if T is a basis for Span(T) in M n . 


Problems 

Let B — { Vi , V2 , V3 } be a basis for a vector space V over a scalar field K. Then show that any vector v £ V 
can be written uniquely as 


V = CiVi +C 2 V 2 +C3V3, 


where ci, C 2 , C 3 are scalars. 


Proof. 

Since B is a basis for V, any vector v £ V is a linear combination of basis vectors in B. 
Thus, there exist scalars C\ , c 2 , C3 £ K such that 



















V = CiV + C 2 V 2 + C3V3. 

Hence such an expression as a linear combination of basis vectors exists. 

We now show that such representation of v is unique. 

Suppose we have another representaion 

v = div + d 2 v 2 + d 3 v 3 

for some scalars d \, d 2 , (I3 £ K. 

Then we have 

CiV + c 2 v 2 + C3V3 = v = diV + d 2 v 2 + d 3 v 3 . 


Thus, we obtain 

(ci - di)v+(c 2 - C? 2 )v2 + (c 3 - d 3 )v 3 = 0. 
Since B is a basis, the vectors Vi, v 2 , V3 are linearly independent. 


Hence the coefficients of the above linear combination must be all zero, and thus we obtain 

C\ — d\ — 0, c 2 - d 2 = 0, C3 — g?3 = 0, 

equivalently, we have 


Ci = d\ , c 2 = d 2 , cs — ds. 

Therefore two representations of the vector v are the same, and thus the representation of v as a linear 
combination of basis vectors Vi, v 2 , V3 is unique. 


O 


Show that the set 


S — {1,1 — x, 3 + 4x + x 2 } 


is a basis of the vector space P 2 °f all polynomials of degree 2 or less. 


Proof. 


We know that the set B = { 1 , x, x 2 } is a basis for the vector space P 2 . 
With respect to this basis B , the coordinate vectors of vectors in S are 



'1" 


' 1 " 


'3' 

[l]s = 

1 

0 0 

, [1 - x]b — 

-1 

. 0 . 

j [3 + 4x + x^]b — 










Recall the fact that the set S is a basis for P 2 if and only if the coordinate vectors 
T — {[ 1 ]#, [1 — x]b, [3 + Ax + x 2 ] b } is a basis fori? 3 . 

Hence we check that T is a basis for i ? 3 . Note that T is a linearly independent set. 
In fact, if we have 


then we have 



"l" 


■ 1 " 


‘ 3 ‘ 

CLl 

0 

+ 0,2 

-1 

+ a 3 

4 


0. 


. 0 . 


1 


CLi + (12 + 3(23 

—a 2 + 4 a 3 

and we obtain a\ — a 2 = a 3 = 0 , and thus T is a linearly independent set. 

Since T consists of three linearly independent vectors in R and the dimension of R is 3 , the set T must 
be a basis for R 3 . 

Therefore, the set S' is a basis for P2. 


0 

0 

0 


O 


Let V be the vector space of all 2 x 2 matrices, and let the subset S of V be defined by 
S = {Ai,A 2 ,A 3 ,A 4 }, where 


7 
6 

Find a basis of the span Span(S) consisting of vectors in S and find the dimension of Span(S'). 



Proof. 


Let B — {£11 ? Ei2, E21, E22 } be the standard basis for the vector space V, where 


Eh 


1 0 
0 0 


jEi2 = 


0 1 
0 0 


) E 21 


0 0 
1 0 


> E22 = 


0 0 
0 1 


With respect to the basis B, we find the coordinate vectors for A \, A2, A3 , A\ as follows. 
Since we have 

A\ = Eh + 2 Ei 2 — E21 + 3E22, 


the coordinate vector for A 


l is 






























Similarly, we have 


' 0 ' 


‘ -1 ' 


‘ 3 ' 

-1 

II 

cq 

CO 

0 

, [■ A^b = 

7 

1 

1 

-2 

. 4 . 


.- 10 . 


. 6 . 


To find a basis for Span^) among vectors in S , we first find a basis for Span(T) among vectors 


T = {[Ai]b, [ A 2 ]b, [Az]b, [^4.4]i?}- 


Let form a matrix whose columns are vectors in T. That is, 


10-13 
2-107 
-11 1 -2 
. 3 4 -10 6 . 

We apply the elementary row operations as follows and obtain a reduced row echelon form matrix. 


1 

0 

-1 

3 ' 

R 2 —2R\ 

"1 

0 

-1 

3 " 



'1 

0 

-1 

3 





i?3 +i?i 





R 3 +R 2 





2 

-1 

0 

7 

i?4 — 3i?i 

0 

-1 

2 

1 

i?4 +4i?2 

0 

-1 

2 

1 





- * 






- > 





-1 

1 

1 

-2 


0 

1 

0 

1 



0 

0 

2 

2 

3 

4 

-7 

6 . 


.0 

4 

-7 

-3. 



.0 

0 

1 

1 





R\ -I--R 4 

'1 

0 

0 

4 ' 



'1 0 

0 

4 ' 





i ?2 — 2 i ?4 





1R 2 









i ?3 — 2 i ?4 

0 

-1 

0 - 

-1 

R 3 ^aR^. 

0 1 

0 

1 






— > 
















0 

0 

0 

0 



0 0 

1 

1 







.0 

0 

1 

1 . 



O 

O 

0 

0. 


The the first three columns of the reduced row echelon form contains the leading 1 ’s. 
Thus by, what I call, the leading 1 method, it follows that 


{[4i]b, [A2]b, [ 43 ]#} 

is a basis for Span(T). 

Therefore, by the correspondence of coordinate vectors, we obtain that 

{Ai,A 2 ,A z } 

is a basis of Span(S'). 

Since the basis {^ 1 ,^ 2 , A% } consists of three vectors, the dimension of Span(iS) is 3. 


in 


























Let P 2 be the vector space of all polynomials of degree 2 or less with real coefficients. Let 

S — { l + x + 2x 2 , x + 2x 2 , —1, x 2 } 

be the set of four vectors in P 2 . Then find a basis of the subspace Span(S) among the vectors in S. 
(The Ohio State University) 


Solution. 

Let B — {1, x, x 2 } be the standard basis of the vector space P 2 . 
With respect to the basis B , the coordinate vectors of vectors in S are 


[1 + x + 2 x 2 ]b — 

’i" 

1 

, [x + 2 x 2 ]b — 

'o' 

1 

, [~1]b = 

"-I" 

0 

, [x 2 ]b = 

'O' 

0 


2 


2 


0 


1 


T — { [1 + x + 2 x 2 ]b, [x + 2 x 2 ]b , [— 1 ]bj [ x<2 ]b } 

be the set of these coordinate vectors. 

We then find a basis of Span(T) among vectors in T by the leading 1 method. 

We reduce the augmented matrix by elementary row operations as follows. We have 


'1 

0 

-1 

O' 

rO 

GO 

to 

'1 

0 

-1 

0 " 

i ?2 —Til 
-> 

'1 

0 

-1 

o' 

1 

1 

0 

0 

1 

1 

0 

0 

0 

1 

1 

0 

.2 

2 

0 

1 . 


.0 

0 

0 

1 . 


.0 

0 

0 

1 . 


The last matrix is in reduced row echelon form and the first, the second, and the fourth columns contain the 
leading l’s. 

Therefore, it follows that 

{ [1 + X + 2 x 2 ]b, [x + 2 x 2 ]b, [x 2 ]b } 
is a basis of Span(T) , and hence 

{l + x + 2x 2 , x + 2x 2 , x 2 } 

is a basis of Span (S') consisting of the vectors of S. 


^ Let P 2 be the vector space of all polynomials of degree two or less. 

Consider the subset in P 2 

Q = {Pl (x),P2(x),P3(x),P4(x)}, 


where 


Pi (x) = x 2 + 2x + 1, p 2 (x) = 2x 2 + 3x + 1, 
Ps(x) — 2x 2 , Pa{x) — 2x 2 + x + 1. 


(a) Use the basis B — {1, x, x 2 } of P 2 , give the coordinate vectors of the vectors in Q. 



















(b) Find a basis of the span Span((2) consisting of vectors in Q. 

(c) For each vector in Q which is not a basis vector you obtained in (b), express the vector as a linear combination 
of basis vectors. 


Solution. 

Coordinate vectors 

(a) With respect to the given basis B = {1, x, x 2 }, the coordinate vectors are 



v 


'1" 

\pi{x)\ B = 

2 

, [P2(z)]b = 

3 


_1_ 


2 


"o' 


'1 

[P3(*)]b = 

0 

, [P4(®)]b = 

1 


2 


2. 


For example, to obtain [ps(x)]b, we express pz{x) as a linear combination of vectors in B and we get 

Ps(x) = 0 - l + 0 - £ + 2 -£ 2 . 

Note that the order of the sum in the linear combination is the order of basis vectors in B — {1, x, x 2 }. 
Then we read the coefficients of this linear combination and obtain the coordinate vectors 


[Ps(x)]b = 


0 

0 

2 


The rest are the same. 


Find a basis of the span 
(b) Let 


T : = 


{[pi(®)]s, [P 2 (x)]b, [ps(x)}b, \pa(x)\b} 



Then a basis of Span(Q) consisting of vectors in Q corresponds to a basis of Span(T) consisting of 
vectors in T. 

Consider the matrix whose columns are the vectors in T. We reduce it by elementary row operations as 
follows. 





















'1 

1 

0 

1 

i ?2 —2Ri 

'1 

1 

0 

1 ' 

2 

3 

0 

1 

- > 

0 

1 

0 

-1 

.1 

2 

2 

2. 

R s —Ri 

.0 

1 

2 

1 . 


'1 

0 

0 

2 " 


'1 

0 

0 

2 " 

0 

1 

0 

-1 

- > 

0 

1 

0 

-1 

.0 

0 

2 

2 . 


.0 

0 

1 

1 . 


The last matrix is in reduced row echelon form and the first three columns contain the leading 1 ’s. 
Therefore the set consisting of the first three vectors in T 


is a basis for Span(T). 

Therefore by the coordinate vector correspondence, it follows that the set 


1 


1 


0 

2 

5 

3 

5 

0 

1 


2 


2 


{Pl(x),P2(x),Pz(x)} 

is a basis for Span(Q) consisting of vectors in Q . 

Linear combination of basis vectors 

(c) From part (b), the only vector in Q which is not a basis vector is Pa(x). 

Thus we want to express p^(x) as a linear combination ofpi(x),p2(x),ps(x). 

Consider a linear combination 

aipi(x) + a 2 p2(x) + a s p 3 (x) + a^p^x) = 0 . (*) 

By taking coordinate vectors, we obtain 



1 


1 


0 


1 

ai 

2 

+ a 2 

3 

+ a 3 

0 

+ <24 

1 


1 


2 


2 


2 


The augmented matrix is 


’ 1 

1 

0 

1 

0 " 

2 

3 

0 

1 

0 

1 

2 

2 

2 

0 


Note that this is the matrix we considered in part (b) except for the last zero column. 

Thus by the same elementary row operations, we obtain the reduced row echelon form of this matrix 






























1 

0 

0 

2 

0 ' 

0 

1 

0 

-1 

0 

0 

_1 

0 

1 

1 

0 


Therefore we obtain the solutions 

a\ — —2a4 
«2 = &4 
«3 = —<H 


and a4 is a free variable. 

(This is the Gauss-Jordan elimination method.) 


Let us take <24 = 1 . Then from the above solutions, we obtain a\ — — 2,(12 — 1,^3 = — 1 . 

We plug these values in the linear combination (*) and obtain 

- 2 pi(x) + p 2 (x) -pz(x) + p 4 {x) = 0. 

Solving this foxp 4 (x), we obtain the linear combination 

p 4 (x) = 2 pi(x) -p 2 (x) +Pz{x). 

(Remark: the coefficients in the above linear combination of p 4 (x) is the fourth column of the reduced 
row echelon form.) 


Let P 2 be the vector space of all polynomials with real coefficients of degree 2 or less. Let 
S = {p! (x),p 2 {x),p 3 (x),p A (x)}, where 

Pi ( x ) = — 1 + x + 2x 2 , p 2 ( x) — x + 3x 2 
Ps ( x ) = 1 + 2x + 8 x 2 , P 4 (x) = 1 + x + x 2 . 

(a) Find a basis of P 2 among the vectors of S- 

(b) Let B' be the basis you obtained in part (a). For each vector of S which is not in B f , find the coordinate vector 
of it with respect to the basis B'. 

(The Ohio State University) 


Solution. 

(a) Find a basis of P2 among the vectors of S. 

The vector space P2 has a basis B — { 1 , x, x 2 } and the dimension of P2 is 3 . 
The coordinate vectors with respect to this basis are 






\Pi( x )]b = 

"-I" 

1 

» [P 2 (x)]b = 

"o' 

1 


2 


3 



'1" 


"r 

[P 3 (*)]b = 

2 

, [P 4 (®)]b = 

1 


8 


1 


Consider the vector space Span(T) spanned by 

T = {[pi(x)]b, \P2(x)]b, [p 3 (x)]b, [P 4 (®)]b>. 

We determine a basis vectors of Span(T) among vectors in T. 

Consider the matrix A whose column vectors are these coordinate vectors: 


A — 


-10 11 
112 1 
2 3 8 1 


Applying the elementary row operations, we obtain 


A = 


"-1 

0 

1 

r 

R2 ~\~Ri 

i?3 +2i?i 

'-1 

0 

1 

1" 

1 

1 

2 

1 

-> 

0 

1 

3 

2 

2 

3 

8 

1. 


. 0 

3 

10 

3. 


-Ri 




i?3 — 3i?2 


'1 

0 

-1 

-1" 

R\ +R% 

R 2 — 3 R 3 

"1 

0 

0 

-4" 

0 

1 

3 

2 

- > 

0 

1 

0 

11 

.0 

0 

1 

-3. 


.0 

0 

1 

-3. 


The first three columns contain the leading 1 ’s. 

By the leading 1 method, we know that 

{\pi( x )]b, [P2(*)]b, [P 3 (*)]b} 
is a basis of the vector space Span(T). 


It follows by the correspondence theorem that 

{pi{x),p 2 {x),p 3 (x)} 

is a basis of Span(S'). 

This yields that Span(S') is a three dimensional subspace of P 2 , which is also three dimensional. 
Hence we have P 2 = Span(S'). 

Therefore 


{Pi(*)>P 2 (*),P 3 (*)} 

is a basis of P2. 

(b) For each vector of S which is not in B' , find the coordinate vector of it with respect to the basis 

B'. 
























In part (a), we found a basis 

B' = {pi(x),p 2 (x),p 3 (x)} 

of the vector space P2 . 

So, we need to find the coordinate vector ofp^x) with respect to B f . 
Thus, we need to solve 


api(x) + bp 2 (x ) + cps(x) = pa(x) 


for a, 6, c. 


Using the coordinate vectors, this is equivalent to solve 


'-1 0 1' 


a 


"1' 

112 


b 

= 

1 

. 2 3 8. 


_c _ 


.1. 


The augmented matrix of this system is exactly the matrix A we dealt in part (a). 
By the reduction of A obtained in part (a) yields that 


a — —4,6 = 11, c = —3. 


Hence we obtain 


p 4 (x) = —4pi{x) + llp 2 (x) + 3 Pz(x), 
and thus the coordinate vector ofpi(x) with respect to B' is 


\P4(x)] b > 


-4 

11 

3 


Remark that this is the last column vector of the reduced row echelon form matrix of A. 


Let P2 be the vector space of all polynomials of degree two or less. Consider the subset in P2 

Q = {pi {x),P2(x),P3{x),P4(x)}, 

where 

Pi(x) = 1, p 2 (x) — X 2 + X + 1, 

p 3 ( x)—2x 2 , P 4 (x) — x 2 — X + 1. 

(a) Use the basis B — { 1 , x, x 2 } of P2, give the coordinate vectors of the vectors in Q. 

(b) Find a basis of the span Span(Q) consisting of vectors in Q. 

(c) For each vector in Q which is not a basis vector you obtained in (b), express the vector as a linear combination 
of basis vectors. 

(The Ohio State University) 










Solution. 

(a) Use the basis B = {1 , x, x 2 } of P2 , give the coordinate vectors of the vectors in Q 

To obtain the coordinate vectors with respect to the given basis B — { 1 , x, x 2 }, we fist express a given 
vector as a linear combination of basis vectors. ( You need to take care of the order of the basis vectors.) 

For example, for the vector P4,(x), we have the linear combination 

Pa(x) = 1*1 + (—1) • x + 1 • x 2 . 

Thus reading the coefficient of the basis vectors 1 , x, x 2 , we see that the coordinate vector of Pa(x) with 
respect to the basis B is 


[P 4 (*)]b = 

By the similar argument, the coordinate vectors are 

\pi(x)\ B = 


1 

-1 

1 


[P3(x)]b = 


\P 2 (x)\ B = 

[Pi(x)] B = 


1 
1 
1 

1 

-1 

1 


(b) Find a basis of the span Span(Q) consisting of vectors in Q 
Let 

T : = {\pi(x)] B , \p 2 (x)] B , \pz{x)] B , \pa{x)] b } 


[ 

1 


1 


0 


1 

\ 

) 

0 

? 

1 

? 

0 

? 

-1 


l 

0 


1 


1 

to 

1_ 


1 

> 


. 


Then a basis of Span(Q) consisting of vectors in Q corresponds to a basis of Span(T) consisting of 
vectors in T. 

Consider the matrix whose columns are the vectors in T. We reduce it by elementary row operations as 
follows. 


"1 

1 

0 

1 ' 

Ri —R 2 

'1 

0 

0 

2 ' 

1*3 

'1 

0 

0 

2 ' 

0 

1 

0 

-1 

- > 

0 

1 

0 

-1 

- > 

0 

1 

0 

-1 

0 

_1 

1 

2 

1 . 

R 3 —R 2 

.0 

0 

2 

2 . 


.0 

0 

1 

1 . 


The last matrix is in reduced row echelon form and the first three columns contain the leading 1 ’s. 
Therefore the set consisting of the first three vectors in T 


























f 

1 


1 


0 

> 

< 

0 

? 

1 

5 

0 

> 

< 

0 


1 


1 

to 

> 


is a basis for Span(T). 

(This is what I call the leading 1 method.) 

Therefore by the coordinate vector correspondence, it follows that the set 

{Pl(x),P2(x),p 3 (x)} 

is a basis for Span(Q) consisting of vectors in Q. 

(c) For each vector in Q which is not a basis vector you obtained in (b), express the vector as a Hr 
combination of basis vectors 

From the solution of part (b), we see that the vector p^(x) is the only vector in Q which is not a basis 
vector. 

Thus we want to express p^(x) as a linear combination ofpi(x)^p2(x)^ps(x). 

Consider a linear combination 

aipi(x) + a 2 p2(x ) + a s p 3 (x) + a^Piix) = 0. 

By taking coordinate vectors, we obtain 



l 


1 


0 


1 


0 

CLl 

0 

+ a 2 

1 

+ 0,3 

0 

+ (24 

-1 

= 

0 


0 


1 


2 


1 


0 


The augmented matrix is 


1 

1 

0 

1 

0 " 

0 

1 

0 

-1 

0 

0 

_1 

1 

2 

1 

0 


Note that this is the matrix we considered in part (b) except for the last zero column. 

Thus by the same elementary row operations, we obtain the reduced row echelon form of this matrix 


1 

0 

0 

2 

0 " 

0 

1 

0 

-1 

0 

1- 

0 

0 

1 

1 

0 


Therefore we obtain the solutions 

CLi — —2 a 4 

a 2 = a4 
G3 = -«4 

and a4 is a free variable. 

Setting <24 = 1 we obtain ai = — 2 , a 2 = 1 , <23 = —1 from the above solutions. 
We plug these values in the linear combination (*) and obtain 

























-2pi(x) +p 2 (x ) -pz(x) +pi(x) = 0. 
Solving this for £>4(3;), the vector p 4 (x) can be expressed as 

p 4 (x) = 2pi(®) -p 2 (x) +pz{x). 


^ Let P 2 be the vector space over M of all polynomials of degree 2 or less. 

Let S = {pi(x),p 2 (x),p 3 (x)}, where 

Pi(x) — x 2 + 1, p 2 (x) — 6x 2 + x + 2, ps(x) — 3x 2 +x. 

(a) Use the basis B — {a; 2 , x, 1} of P 2 to prove that the set S is a basis for P?. 

(b) Find the coordinate vector of p(x) = x 2 + 2x + 3 £ P 2 with respect to the basis S. 



Solution. 


The coordinate vectors of p\ ( x ) with respect to the basis B = {x 2 , at, 1} is given by 


[?i(*)]b = 


1 

0 

1 


as P(X) can be written as a linear combination pi(x) ~ 1 ■ x 2 


+ 0 • x + 1 • 1 of the basis vectors in B 


Similarly, we have 



"6' 


'3' 

\P 2 (x)]b = 

1 

_2_ 

and [pz(x)\ B = 

1 

.0. 


Let T = {[pi(x)]b, \P2(x)\ b , [P 3 (*)]b}- 
Then S' is a basis for P 2 if and only if T is a basis for R 3 . 
Thus it remains to show that T is a basis for R . 


Consider the matrix whose column vectors are vectors in T. 
We have 


"1 6 

3' 

0 ? 

1 

CO 

'1 6 

3 ' 

0 1 

1 

0 1 

1 

.1 2 

0 . 


1 

O 

-3. 


R\ — 6i?2 
Rs +4i?2 


'1 

0 

— 3" 

R\ +3i?3 

^2 — ^3 

'1 

0 

o' 

0 

1 

1 

-> 

0 

1 

0 

.0 

0 

1 . 


.0 

0 

1 . 




















It follows that T is linearly independent. 

As T consists of three linearly independent vectors in M 3 , it is a basis for M 3 . 

Hence S is a basis for P 2 . 

(b) Find the coordinate vector of p{x) = x 2 + 2 x + 3 E P2 with respect to the basis S. 

To find the coordinate vector \p(x)]g with respect to the basis S, we express p(x ) as a linear 
combination of the basis vectors in S. 

Thus we want to find scalars C \, c 2 , C3 such that 

p(®) = cipi(x) + c 2 p 2 (x) + csps(x). 


Considering the coordinate vectors of both sides, this is equivalent to 


1 


1 


6 


3 

2 

= Cl 

0 

+ C 2 

1 

+ C 3 

1 

3 


1 


2 


0 


We write this equation as a matrix equation 


'1 6 3 " 


Cl 


'1" 

Oil 


C 2 

= 

2 

.1 2 0 . 


_C 3 _ 


.3. 


To solve this, we apply elementary row operations to the augmented matrix as follows. 


" 1 

6 3 

1 " 

0? 

1 

CO 

ft} 

" 1 

6 3 

1 " 

O 

1 1 

2 

0 

1 1 

2 

1 

2 0 

3 


1 

0 

-4 -3 

2 


R\ — 6i?2 
Rs +4i?2 


1 

0 

-3 

-11 " 

Ri +3i?3 

" 1 

0 

0 

19 " 





R 2 —R 3 





0 

1 

1 

2 

- » 

0 

1 

0 

-8 

0 

0 

1 

10 


0 

0 

1 

10 


(Note that elementary row operations are exactly the same as before.) 

Hence the solution is C\ = 19 , C2 = —8, C3 = 10 . 

Thus, we have the linear combination 

p(x ) = 19 pi(x) — 8p2(x) + 10 pz(x) 

and the coordinate vector of p(x) with respect to the basis S is 


[p(*)]s 


19 

-8 

10 
































o 


Let 7^3 be the vector space of all polynomials of degree 3 or less. 
Let 


S = {pi{x),P 2 (x),p 3 (x),p 4 (x)}, 


where 


Pi ( x ) — 1 + 3 ^ + 2 x 2 — x 3 P2 (x) = x + x 3 
P3 (cc) = cc + cc 2 — cc 3 P4 (x) = 3 + 8a: + Sx 3 . 

(a) Find a basis Q of the span Span (S') consisting of polynomials in S- 

(b) For each polynomial in S that is not in Q , find the coordinate vector with respect to the basis Q. 
(The Ohio State University) 


Solution. 

(a) Find a basis Q of the span Span (S') consisting of polynomials in S. 
Let B = {1, x, x 2 , x 3 } be the standard basis for V 3 . 

With respect to the basis B , the coordinate vectors of the given polynomials are 


vi 



v 2 := [p 2 (®)]s = 


0 

1 

0 

1 



' 0 " 


■ 3 ' 

V3 := \pz{x)\b = 

1 

1 

, v 4 := \pa(x)] b = 

8 

0 


.- 1 . 


.8. 


Let T — {vi, v 2 , V 3 , V 4 } be the set of these coordinate vectors. 


We find a basis for Span(T) among the vectors in T by the leading 1 method. 

We form the matrix whose column vectors are the vectors in T and apply elementary row operations as 
follows. 


1 

0 

0 


3" 

i ?2 — 3i2i 


'1 

0 

0 

3 ' 







i ?3 —2R\ 







3 

1 

1 


8 

-R 4 -\-R\ 

s 

0 

1 

1 

-1 

i?4 — R 2 

2 

0 

1 


0 


? 

0 

0 

1 

-6 


.-1 

1 

-1 


8. 



.0 

1 

-1 

11 . 




"1 

0 


0 3 ' 




'1 

0 0 

3 ' 




0 

1 


1 -1 

R 4 +2R 2 

0 

1 0 

5 




0 

0 


1 -6 



- 7 

0 

0 1 

-6 




0 

0 


-2 12 




0 

0 0 

0 






















The first three columns of the reduced row echelon form matrix contain the leading 1 ’s. 

Thus, {Vi, V2, V3 } is a basis for Span(T). 

It follows that Q := {pi(x), p 2 (x), ps(x)} is a basis for Span (S'). 

(b) For each polynomial in S that is not in Q, find the coordinate vector with respect to the basis Q. 
Note thatp4(x) is not in the basis Q. 

The fourth column of the matrix in reduced row echelon form of part (a) gives the coefficients of the linear 
combination: 


Pi(x) = 3pi(x) + 5 p 2 (x) - 6p 3 (x). 
Thus, the coordinate vector of £>4(2:) with respect to the basis Q is 


[P4(*)]q = 


3 

5 


Comment. 


This is one of the midterm 2 exam problems for Linear Algebra (Math 2568 ) in Autumn 2017 . 


In part (b), some students stopped after obtaining the linear combination 
Pa(x) = 3pi(x) + 5 p 2 (x) - 6 ps(x). 

You must read the problem carefully. You are asked to find the coordinate vector of p^{x) with respect to 

Q- 


Let P 2 be the vector space of polynomials of degree 2 or less. 

(a) Prove that the set {1,1 + cc, (1 + x) 2 } is a basis for P 2 . 

(b) Write the polynomial f(x) = 2 + 3x-x 2 as a linear combination of the basis {1,1 + x, (1 + x) 2 }. 


Proof. 

(a) Prove that the set {1, 1 + X, (1 + x ) 2 } is a basis for P 2 . 

Consider the standard basis B = {1, x, x 2 } of P2. Using this basis, we can write the elements using 
coordinate vectors as 


[1]b = 

’1" 

0 

[1 + x]b = 

’1" 

1 

[(1 + *) 2 ]b = 

V 

2 


0 


0 


1 


We find the coordinate vector by writing an element as a linear combination of the basis elements. For 
example, (1+#) 2 = 1 + 2 x + lx 2 , and so the coefficients 1, 2,1 translate into the column vector 

" 1 " 













Because dim P 2 = 3, this set is a basis if and only if these three vectors are linearly independent. To 
verify this, consider the matrix 

"1 1 1" 

0 12. 

_0 0 1_ 

This matrix is upper-triangular, and the diagonal entries are all non-zero. This implies the matrix is non¬ 
singular, and so the columns are linearly independent. 

Thus, the set {1,1 + x, (1 + x) 2 } is a basis of P2. 

(b) Write the polynomial f(x ) = 2 + 3x- x 2 as a linear combination of the basis 

{M + *, (1 + z) 2 }- 

First Method: Using the technique of completing the square, we can factor the polynomial f(x) as we like. 
Specifically, 

f(x) — —x 2 + 3x + 2 

=— (x T - l) 2 T" hx T" 3 
——{x + l) 2 + 5(x + 1)—2. 

Hence, we have the linear combination 

f{x) = -2-l + 5(l + x)-(l + x) 2 . (*) 


Second Method: We can find this factorization by calculating the Taylor polynomial of f(x) centered at 
- 1 . 

This Taylor polynomial is defined by 

/w=/<-1)+/(-lx*+1)++1) 2 - 

The polynomial in (*) is recovered by finding /( — 1 ) — — 2 ,/ , ( — 1 ) — 5 , and/ ,, (— 1 ) = — 2 . 


^ Let V be the vector space of all 2 X 2 real matrices. Let S — {Ai, A2, A$, A4}, where 

3 7 

-2 6 

Then find a basis for the span Span (S'). 



Solution. 

Let B — {Eh ■ E u , E 2 i , E22 } t> e the standard basis of V, where 

0 0 
0 1 

We find the coordinate vectors of A \, A 2 , A3, A4 with respect to the basis B. 


En = 


1 0 
0 0 


,Eu = 


0 1 
0 0 


,E 2 i = 


0 0 
1 0 


) E22 = 






















For the matrix Ai, we can write it as a linear combination of basis vectors of B as follows. 
We have 


A\ = 1 • En + 2^12 + (—l)£?2i + 3^22- 


Hence the coordinate vector of Ai with respect to the basis B is 


[Ai] b 


1 

2 

-1 

3 


Similarly, we obtain 


' 0 ' 


‘ -1 ' 


‘ 3 ' 

-1 

> [Az\b — 

0 

5 [Ai]B — 

7 

1 

1 

-2 

. 4 . 


.-10. 


. 6 . 


Let 


T — {[Ai\b, [A 2 ]b, [M\b, [^. 4 ]^} 


/ 

1 


0 


-1 


3 

\ 


2 


-1 


0 


7 



-1 


1 


1 


-2 


< 

3 


4 


-10 


6 

> 


Then T is a subset in M 4 . 


We find a basis of Span(T) by the leading 1-method. 

We form a matrix whose columns are vectors in T and reduce it by elementary row operations as follows. 



' 1 

0 -1 

3 ' 


R 2 —2R\ 

Rs ~\~Ri 

"] 


0 

-1 

3 ' 


2 

-1 0 

7 


i?4 SRi 
-* 

0 

-1 

2 

1 


-1 

1 1 

-2 


0 

1 

0 

1 


. 3 

4 -10 

6 . 



.0 

4 

-7 

-3. 



"1 0 -1 

3 


i?3+i?2 


"1 

0 

-1 

3 ' 

i?2 ^^3 

0 10 

1 


i?4 — 4i?2 


0 

1 

0 

1 


- » 




- ► 







0-12 

1 




0 

0 

2 

2 



1 

O 


3 



0 

0 

-7 

-7 


'1 

0 

-1 

3' 

Ri +R 3 

'1 

0 

0 

4" 

0 

1 

0 

1 

R 4 —R 3 
-* 

0 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1 

1 

.0 

0 

1 

1. 


.0 

0 

0 

0. 




































The last matrix is in reduced row echelon form and the first three columns contain the leading 1. 
Thus the first three vectors of T is a basis of Span(T). 

Hence 


{[Ai]b, [M]b, [^ 3 ]#} 

is a basis of Span(T). 

This yields that 


{Ai,A 2 ,A 3 } 


is a basis of Span(S') by the correspondence of coordinate vectors. 


Let C[— 1,1] be the vector space over R of all continuous functions defined on the interval [—1,1]. Let 
V := {f(x) £ C[- 1 , 1 ] | f(x) = ae x + be 2x + ce 3x , a,b,c£ M) 

be a subset in C[— 1,1]. 

(a) Prove that V is a subspace of C[— 1,1] . 

(b) Prove that the set B = { e x , e 2x , e^ x } is a basis of V . 

(c) Prove that B' = {e x — 2e 3x , e x + e 2x + 2e 3:E , 3e 2a: + e 3:r } is a basis for V. 


Proof. 

(a) Prove that V is a subspace of C[—1, 1] . 

Note that each function in the subset If is a linear combination of the functions e x , e 2x , e 3x . 
Namely, we have 

V = Spanje^, e 2cc , e 3x } 

and we know that the span is always a subspace. Hence If is a subspace of C[— 1,1]. 

(b) Prove that the set B = \e x , e 2x , e 3x } is a basis of If. 

We noted in part (a) that V = Span (B ). So it suffices to show that B is linearly independent. 
Consider the linear combination 

cie* + c 2 e 2x + c 3 e 3x = 9(x ), 

where is the zero function (the zero vector in Tf). 

Taking the derivative, we get 

c\e x + 2c 2 e 2x + 3cse Sx — 6{x). 

Taking the derivative again, we obtain 

c\e x + 4:C 2 e 2x + 9cse Sx — 6{x). 

Evaluating at x — 0, we obtain the system of linear equations 



Cl + C 2 + c 3 = 0 

ci + 2 c 2 + 3c 3 — 0 
ci + 4 c 2 + 9c 3 = 0. 


We reduce the augmented matrix for this system as follows: 


’ 1 

1 

1 

0 " 

R 2 —R\ 
- > 

' 1 

1 

1 

0 " 

1 

2 

3 

O ' 

0 

1 

2 

O ' 

1 

4 

9 

0 

Rs —Ri 

1 

0 

3 

8 

0 


1 

0 

-1 

0 " 

CO 

" 1 

0 

-1 

0 " 

Ri +R 3 

" 1 

0 

0 

0 " 

0 

1 

2 

0 

- > 

0 

1 

2 

0 

- > 

0 

1 

0 

0 

0 

0 

2 

0 


0 

0 

1 

0 

10 

l 

to 

0 

0 

1 

0 


It follows that the solution of the system is C\ — C2 — C3 =0. 
Hence the set B is linearly independent, and thus B is a basis for V. 


Anotehr approach. 


Alternatively, we can show that the coefficient matrix is nonsingular by using the Vandermonde 
determinant formula as follows. 

Observe that the coefficient matrix of the system is a Vandermonde matrix: 


A := 


1 

1 

1 2 


1 1 

2 3 

2 2 3 2 


The Vandermonde determinant formula yields that 


det (A) = (3 - 1)(3 - 2)(2 - 1) = 2 / 0. 

Hence the coefficient matrix A is nonsingular. 

Thus we obtain the solution C\ — C2 — C3 = 0. 

(c) Prove that B' = {e x — 2e 3x , e x + e 2x + 2e 3x , 3e 2x + e 3x } is a basis for V. 

We consider the coordinate vectors of vectors in B f with respect to the basis B. 

The coordinate vectors with respect to basis B are 


e x - 2e 3x ] B = 

' 1 ' 

0 

, [e x + e 2x + 2e 3x ] B = 

'1" 

1 

, [3e 2x + e 3x } B = 

'O' 

3 


-2 


2 


1 


Let Vi, V2, V3 be these vectors and let T = { Vi, V2, V3 }. 

Then we know that B f is a basis for V if and only if T is a basis for R 3 . 


We claim that T is linearly independent. 

Consider the matrix whose column vectors are Vi, V2, V3: 






























' 1 

1 

O' 

i?3 -\-2R\ 

"1 

1 

o' 

0 

1 

3 

-* 

0 

1 

3 

.-2 

2 

1 . 


.0 

4 

1 . 


'1 

0 

-3 ' 

R» 

'1 

0 

-3' 

R\ +3i?3 

'1 

0 

o' 

0 

1 

3 

- > 

0 

1 

3 

- > 

0 

1 

0 

.0 

0 

-11. 


.0 

0 

1 . 

R 2 — SRs 

.0 

0 

1. 


Thus, the matrix is nonsingular and hence the column vectors Vi, V2, V3 are linearly independent. 

As T consists of three linearly independent vectors in the three-dimensional vector space M 3 , we conclude 
that T is a basis for IR 3 . 

Therefore, by the correspondence of the coordinates, we see that B' is a basis for V. 

Related Question. 

If you know the Wronskian, then you may use the Wronskian to prove that the exponential functions 
e®, e 2x , e 3x are linearly independent. 

See the post 

Using the Wronskian for Exponential Functions, Determine Whether the Set is Linearly Independent for 
the details. 


Try the next more general question. 


Problem. 

Let Ci, C2,. . ., c n be mutually distinct real numbers. 

Show that exponential functions 

C\X C 2 X CnX 


are linearly independent over M. 


The solution is given in the post 1 
Exponential Functions are Linearly Independent 


0 


Let P ra (M) be the vector space over M consisting of all degree n or less real coefficient polynomials. Let 


U = {p(x) e P„(R) I p( 1) = 0} 


be a subspace of P ra (M). Find a basis for U and determine the dimension of U . 


Solution. 

















Let p(x) — a n x n + a n -\X n 1 + • • • + a±x + ao be an element in U. 

Since p(l) — 0, we have a n + a n -i + • • • + a\ + ao = 0. Thus we have 

p(x) = a n x n + a n -ix n ~ l H-b a\x - (a n + a n _i H-h ai) 

= a n (x n - 1) + a n _i(x n_1 — 1) 4- a\(x — 1). 


Let us define 


<?n(z) = x n - l,q n -i(x) = x n 1 - = x - 1 G U. 

Then the above computation shows that we have 

p(x) = a n q n (x) + a n -iq n -i(x) H- aiqi(x). 

In other words, any element p(x) £ U is a linear combination of q\ (x ),..., q n (x). 
Hence B {qi ( x ),..., q n (a^)} is a spanning set for U. 


We show that B is a linearly independent set. 
Consider a linear combination 


c\q\{x) H-b c n q n (x ) = 9{x) 0x n + 0x n 1 + • • • + Ox + 0. 


Then we have 


c n x n + c n -ix n 1 + • • • + CiX — (ci + C 2 H-+ c„) = 0. 


Comparing the coefficients of both sides, we obtain 

Cl = C 2 = * * * = c n = 0 

and thus q\ (x),..., q n ( x ) are linearly independent, and hence B — { q\ (x),..., q n (#)} is a basis for 
the subspace U . Since the dimension is the number of vectors in a basis, the dimension of U is n. 


Let V be a vector space over R and let B be a basis of V. Let S — {v\ , U 2 , U 3 } be a set of vectors in V . If the 
coordinate vectors of these vectors with respect to the basis B is given as follows, then find the dimension of V 
and the dimension of the span of S. 


"1" 


O 


"1" 

n 

, Mb = 

1 

5 Mb = 

1 

O C 

0 

0 

. 0 . 


. 0 . 


-1 

0 
_1 


Solution. 

Coordinate vectors 









Let us first recall the definition of coordinate vectors. 

Suppose that V is a vector space over R and let B — {v \, v 2 ,.. •, v n } be a basis of V (hence the 
dimension of V is n). 

Then any element of v of V can be written as 

V = C1V1 + c 2 v 2 H-h c n v n , 

where C \, C2,..., c n are scalars in R. 

This expression is unique and the coordinate vector of v with respect to the basis B is defined as 

ci 
C2 

Cn _ 

Thus the coordinate vector [v] b is just a usual n -dimensional vector. 

Main part of the solution 

Note that in the current problem, the coordinate vectors are 4 -dimensional vectors. 

This implies that the basis B consists of four vectors. Hence the dimension of V is 4 . 

By the correspondence of the coordinate vectors, the dimension of Span (S') is the same as the 
dimension of Span(T), where 



T — {[vi]b, [V2]b, [V2 ]b} 


1 


0 


1 

0 


1 


1 

0 

5 

0 

? 

0 

0 


0 


0 


To find the dimension of Span(T) , we need to find a basis of Span(T). 

One way to do this is to note that the third vector is the sum of the first two vectors. Also, it’s clear that the 
first two vectors are linearly independent. 


Thus, the set 



is a basis of Span(T), hence the dimension of Span(T) is 2. 

We conclude that the dimension of Span(iS) is 2 as well. 

(We can also conclude that the set {'Ci, v 2 } is a basis of Span (S').) 

Another way to find a basis of Span (T) 














Here is another way to find a basis of Span(T) . We can use the leading 1 method. 

We form the matrix whose column vectors are the vectors in T : 

"1 0 1 " 

0 1 1 

0 0 0 

_0 0 0 _ 

Note that this matrix is already in reduced row echelon form. 

The first two columns contain the leading l’s. Thus the first two columns form a basis of Span(T) (this is 
the leading 1 method). 

Remark: In general we apply elementary row operations to reduce the matrix into a matrix in reduced row 
echelon form. 


Let V be a vector space and B be a basis for V. 

Let wi, w 2 , w 3 , W 4 , w 5 be vectors in V. 

Suppose that A is the matrix whose columns are the coordinate vectors of wi, W 2 , W 3 , W 4 , W 5 with respect to 
the basis B. After applying the elementary row operations to A, we obtain the following matrix in reduced row 


echelon form 


10 2 
0 1 3 

0 0 0 
0 0 0 


1 0 " 
0 1 
0 0 
0 0 


(a) What is the dimension of V? 

(b) What is the dimension of Spanjwi, w 2 , w 3 , W 4 , w 5 }? 
(The Ohio State University) 


Solution. 

(a) What is the dimension of Vl 

Suppose the dimension of V is n. 

This means that the basis B consists of n vectors. 

Then the coordinate of Wi with respect to B is an n-dimensional vector [ Wi ] B €= K - n • 

Thus, the number of rows in the matrix A is n. 

As the elementary row operations do not change the number of rows, we see from the given matrix that the 
A has four rows. 

Thus, the dimension of If is 4 . 

(b) What is the dimension of Span{ Wi, W2, W3, W4, W5 }? 

Note that the dimension of W Span{wi, w 2 , W3, W4, W5 } is the same as the dimension of 

W' := Span{[wi] 5 , [w 2 ] B , [w 3 ] s , [w 4 \ B , [w 5 ] s }. 

Since the column vectors of A are these coordinate vectors [w z ]^ and its reduced row echelon contains 
the leading l’s in the first two columns, we conclude that {[wi]#, [w 2 ] j g} is a basis for W' by the 







leading 1 method. 

It follows that {wi, W2 } is a basis for W, and its dimension is 2 . 


^ Let V be the set of all n X n diagonal matrices whose traces are zero. That is, 


/ 

an 

0 . 

.. 0 




0 

a 2 2 • 

.. 0 


CZ-il ? ? O'nn E C, 

< A = 

0 

0 



tr(A) = 0 


. 0 

0 . 

• • ®nn_ 




Let Eij denote the n X n matrix whose (i, j) -entry is 1 and zero elsewhere. 

(a) Show that V is a subspace of the vector space M n over C of all n X n matrices. (You may assume without a 
proof that M n is a vector space.) 

(b) Show that matrices 


L 1 77J 771 771 771 771 

L/ll — -c /22 5 *- j 22 ~ ^33 5 • • • j Vln -1 — L nn 

are a basis for the vector space V. 

(c) Find the dimension of V. 


Hint (Subspace Criteria) 


For (a), use the criteria for a subset to be a subspace. 

^ The zero vector 0 E M n is in V. 

^ For u, v E V, the sum u + v E V. 

0 For v E V, c E C, the scalar product cv E V. 


Proof. 

(a) Visa subspace of the vector space M n over C 
We check the following criteria for a subset to be a subspace. 
^ The zero vector 0 E M n is in V. 


o For u, v G V, the sum u + v G V. 


£ For v E V, c E C, the scalar product cv E V. 











For ( 1 ), note that the zero vector 0 £ M n is the n X n zero matrix. Thus it is clearly in V. 


To check ( 2 ), take u, v £ V. Then u, v are diagonal matrices and tr(u) = 0, tr(v) = 0. 

Thus the sum u + v is a diagonal matrix and the trace is 

tr(u + v) = tr(u) + tr(v) = 0 + 0 = 0. 

Hence the sum u + v £ V . 

Finally, to prove ( 3 ) we take v £ V, c £ C. 

Then v is a diagonal matrix and tr(v) = 0. 

Thus cv is also a diagonal matrix and the trace is 

tr(cv) = ctr(v) = c • 0 = 0. 

Hence cv £ V. 

Therefore the subspace criteria (l)-( 3 ) hold. Thus V is a subspace of the vector space M n . 

(b) A basis for the vector space V. 

Note that for each 1 < i < n — 1 , the matrix Ea — E{+\ is a diagonal matrix and the trace 

tr (Ea ~ E i+li+ 1 ) = tr(Eu) - tr (E i+li+1 ) = 1-1 = 0. 

Hence the matrix Ea — Ei+\ \ £ V. 

We show that the matrices Ea — E{+i £+1 span V. 

Let A = ( a,ij ) £ V. We want to solve the following linear combination. 

A — Ci (En ~ E22) + C -2 (E22 ~ ^ 33 ) + ' * * + Cn-l(En-ln-l ~ E. n n )• 

Rearranging this, we obtain 

A — CiEn + (02 — Ci)E22 + • • • + (c n - 1 — C n -2)E n -i n -i — C n -iE nn . 

Namely, in the matrix form we have 



0 . 

«22 



ci 0 

. O' 

an 

0 

0 0 


0 c 2 — ci 

. 0 

. 0 

0 . 



0 

0 C n —1 2 0 

• • C^nn _ 


. 0 

... 0 C n ~1 _ 


Comparing entries, we have the system of n equations in n — 1 unknowns Ci,..., c n _i 






a 11 = ci 

&22 = C2 — Ci 
«33 = C3 — C2 


C-n—In—1 — C n _i C n _2 
^nn C ni • 


We solve this system as follows. 
The matrix form of this system is 


1 

-1 1 
-1 1 



Cl 


a n 

C2 

= 

O 

• • to 

to 

_ Cn—1_ 


_ d nn _ 


(The empty entries are all zero.) 


The augmented matrix is 


1 

an 

-1 1 

&22 

-1 1 

«33 

-1 1 

O’n— In—1 

-1 



Then we reduce this matrix as follows. 

Add the first row to the second row, then add the second row to the third row. 
Repeating this process we get 


" 1 

an 

1 

dll + d22 

1 

a ll + a 22 + a 33 

1 

&11 + d22 + * * * + a n -i n -i 

0 

dll + d22 + * * * + d nn 


Note that since A E V, tr(A) = an + <222 + • • • d nn — 0 , thus the right-bottom entry ((n, n) -entry) 
of the augmented matrix is 0, hence the system is consistent and it has the unique solution 














ci = an 

C2 = an + 0-22 

C3 — 0 \\ + 022 + O33 


Cn-l — a ll + a 22 + • • • + a„_i n _i. 


n 


Therefore the matrices E\\ — E22 , E22 ~ E33, • • •, E n -\ n -\ — E nn span V. 


To show that these matrices are linearly independent, we can reuse the above computation. 

Suppose we have a linear combination 

0 = Cl (Eli — E22) + C2(E22 — E 33 ) + • • • + c n _i(E n _i n _i — E nn ). 

Here 0 is the n X n zero matrix. 

Since an = 0 in the above computation, we see from (*) that c\ — C2 — * * * = c n _i = 0 , hence the 
matrices En — Ei+u+i are linearly independent. 

Therefore those matrices are a basis of V. 

(c) The dimension of V. 

From part (b), the n — 1 matrices 

T7> 771 771 HP HP HP 

till ~ ti22, ti22 ~ ^ 33 , . . . , Ai-ln -1 — ti nn 

are a basis of V . Thus the dimension of V is n — 1. 


Let C[— 2 tt, 27 r] be the vector space of all real-valued continuous functions defined on the interval [— 27 T, 27 t] . 
Consider the subspace W = Spanjsin 2 (cc), cos 2 (x)} spanned by functions sin 2 ( x ) and cos 2 ( x ). 

(a) Prove that the set B = {sin 2 (cc), cos 2 (x)} is a basis for W. 

(b) Prove that the set {sin 2 ( x ) — cos 2 (cc), 1 } is a basis for W. 


Solution. 

(a) Prove that the set B = {sin 2 (x ), cos 2 (x) } is a basis for W. 

By definition of the subspace W — Span{sin 2 (x) , cos 2 ( x )}, the we know that B is a spanning set for 

w. 


Thus, it remains to show that B is linearly independent set. 
Suppose that 


ci sin 2 (x) + C2 cos 2 (x) = 0. 
This equality is true for all x G [— 27 T, 27 t], 



In particular, evaluating at x = 0 , we see that C2 = 0 . 

Also, plugging in x — 7t/2 yields C\ — 0 . 

Therefore, sin 2 (x) and cos 2 (x) are linearly independent, that is, B is linearly independent. 

As B is a linearly independent spanning set, it is a basis for W. 

Prove that the set {sin 2 (x) — cos 2 (x), 1} is a basis for W. 

Note that sin 2 (x) — cos 2 ( x ) and 1 are both in W since both functions are linear combination of 
sin 2 (x) and cos 2 (x). Here, we used the trigonometric identity 1 = sin 2 (x) + cos^ x). 

By part (a), we see that dim(V^) = 2 . So if we show that the functions sin 2 (x) — cos 2 (x) and 1 are 
linearly independent, then they form a basis for W. 


We consider the coordinate vectors of these functions with respect to the basis B. 
We have 


[sin 2 (x) — cos 2 (x)]b — 

[1\ B = [sin 2 (x) + cos 2 (x)\ b = 


and 

1 
1 


Since we have 


■ 1 

r 

R2 ~\~Ri 

'1 

r 

\R, 

'1 

r 

R\ R\ 

'1 

o' 

-1 

1. 

- > 

.0 

2. 

- > 

.0 

1. 

- > 

0 

1. 


the coordinate vectors are linearly independent, and hence sin 2 (x) — cos 2 (x) and 1 are linearly 
independent. 

We conclude that {sin 2 (x) — cos 2 (x), 1} is a basis for W. 

(Another reasoning is that since the coordinate vectors form a basis for M 2 , {sin 2 (x) — cos 2 (x), 1} is 
a basis for W.) 

Comment 

You may directly show that {sin 2 (x) — cos 2 (x), 1} is linearly independent just like we did for part (a). 


O Suppose that B — {vi, V 2 } is a basis for M 2 . Let S [vi, V 2 ]. Note that as the column vectors of S are 
linearly independent, the matrix S is invertible. Prove that for each vector vGf, the vector S' -1 v is the 
coordinate vector of v with respect to the basis B. 


Proof. 

We first express the vector v as a linear combination of the basis vectors 


V = CiV + C2V 2 . 


















This expression is unique and the coordinate vector of v with respect to the basis B is defined to be 


v]b = 


Cl 

C2 


Let 


S 


-l 


v = 


Xi 

X 2 


Or equivalently, 


v = S 



X 2 


Our goal is to show that 


~ Xi 


Ci 



_C2_ 


We have 


ClV + C2V 2 = V = S 


Xi 

X 2 


= XlVl + X2V 2 . 


Hence 


(*1 - Ci)Vi + (x 2 - C 2 )v 2 = 0. 


As B — {v 1 , V 2 } is linearly independent, we obtain x\ — C\ and X2 — C 2 - 


s~\ = 


ci 

C2 


= v \B- 


Therefore, 
















Dimensions of General Vector Spaces 


Definition 


Q The dimension dim(Tf ) of a vector space V is the number of vectors in a basis for V. 


Summary 


Let V be a vector space over a scalar field K. Suppose that \dim(V)=n. Le£S=\{\mathbf{w}_l, \dots, 
\mathbf{w} _k\}beasetof vector sin\%. 

The dimension of V does not depend on the choice of a basis. 

Q If W is a subspace of V , then dim(f^) < dim(Tf). 

Q If k > n, then the set S is linearly dependent. 

If k < n, then the set S cannot span the vector space V. 


Problems 


Let V be the vector space of all 2 X 2 matrices whose entries are real numbers. Let 


W = 


A e V 




for any a, b, c £ 


}■ 


(a) Show that W is a subspace of V. 

(b) Find a basis of W. 

(c) Find the dimension of W. 

(The Ohio State University) 


Solution. 

(a) Show that IF is a subspace of V. 


To show that W is a subspace of V, we verify the following subspace criteria. 
The zero vector in V is in W. 

For all A,B E W, the sum A + B E W. 

Q For all A E W and r E R, the scalar multiplication rA E W. 




















Note that the zero vector in V is the zero matrix 


“0 

0 


o- 

0 


The zero matrix corresponds to the matrix in W with a = b = c = 0 , and hence the zero vector of V is 
in W. 

Condition 1 is met. 


To verify condition 2 , let 


A = 


b 

— a 


and A = 


b 

-a 


be arbitrary elements in W, where a,b,c,d, a ,b ,c ,d E R. 
Then we have 


A+A = 


" a b 


a b 


a + a b + b 


+ 


= 


c ~a 


c ~a 


c + c —(a + a) 


and this is of the form of elements of W. Hence A+A E W and condition 2 is met. 


check condition 3 . 

be an arbitrary element in W and let r E R be any real number. 

Then we have 


Finally, let us 
"a b 


Let 


-a 



“a b 


' ra rb 

rA = r • 

c ~a 

— 

rc ~(ra) 


and this is of the form of the elements of W. Thus rAEW and condition 3 is met. 
Therefore, by the subspace criteria, we conclude that W is a subspace of V. 

(b) Find a basis of W. 


Any vector A = 


a 

c 



in the subspace W can be written as 


"a b 


■1 

0 ‘ 


■0 

1 ■ 


■0 

0" 

c ~a 

= a 

0 

-1 

+ b 

0 

0 

+ c 

1 

0 


It follows that the matrices 


■1 0 ■ 


■0 1 ■ 


■0 

0" 

1 

o 

? 

0 0 

? 

1 

0 




































span the subspace IF. 


If the above linear combination is the zero matrix, then a = b = c = 0 . 
This implies that these matrices are linearly independent. 

Thus, 


1 0 1 
0 -1 


0 1 
0 0 


0 0 
1 0 


is a basis for W. 

(c) Find the dimension of W. 

As the basis we obtained contains three vectors, the dimension of IF is 3 . 


Let V be the vector space of all 2 X 2 matrices. Let W be a subset of V consisting of all 2 X 2 skew-symmetric 
matrices. (Recall that a matrix A is skew-symmetric if A T = — A.) 

(a) Prove that the subset W is a subspace of V. 

(b) Find the dimension of W. 

(The Ohio State University) 

Proof. 

(a) Prove that the subset IF is a subspace of V 


To prove that IF is a subspace of F, we check the following subspace criteria. 


(i) The zero vector 0 E V is in IF. 

(ii) For any vectors u, v e IF, the sum u + v is in IF. 

(iii) For any vector u E W and any scalar c E R, the scalar product cu E W. 


The zero vector in V is the 2 x 2 zero vector 


0 = 


"0 

0 


0" 

0 


Since have 


0 T =0 = -O, 

the zero vector O is a skew-symmetric matrix. Thus O is in IF, and hence condition (i) is met. 


For condition (ii), consider A, B E W. This means that A, B are skew symmetric matrices and thus we 
have 










a t = -a,b t = -b. 

To show that A + 5 6 W, we compute as follows. 

(*) 

(A + B) t = A t + B t = -A + (~B)= - (A +B). 


Thus we have (A + B) T = — (A • 5 ), and it follows that the matrix .4 + 5 is skew-symmetric. 
Hence A+ B E W and condition (ii) is also met. 


To check condition (iii), consider A E W and c E R. 

We want to show that cA E W, that is, we want to show that cA is a skew-symmetric matrix. 


We see this by computing as follows. 

(cA) J = cA t 

= c( — A) since A is skew-symmetric 
= - (cA). 

Thus we obtained ( cA) T = - ( cA ), and thus cA is a skew-symmetric matrix as required. 
Hence cA E W and condition (iii) is satisfied. 


We confirmed all three subspace criteria (i)-(iii), and thus conclude that W is a subspace of V. 


(b) Find the dimension of W 


Let A = 


a b " 
c d 


be a 2 x 2 matrix. 


If A is a skew-symmetric matrix, namely we have A T = - A, we have 


' a 

c ' 


'a b " 

b 

d 


c d 


Comparing entries of the matrices, we obtain 

a = - a 
b = - c 
d d. 


It follows that a = 0 , d = 0 , and c = - b. 

Thus any skew-symmetric matrix is of the form 



■ 0 b ■ 


■ 0 1 ■ 

A = 

-b 0 

= b 

-1 0 


Therefore, the subspace W is spanned by 















and since the set B consists of only one vector, it is linearly independent, and thus B is a basis. 
From this, we conclude that the dimension of W is 1 . 


Let P 4 be the vector space consisting of all polynomials of degree 4 or less with real number coefficients. Let W 
be the subspace of P 2 by 

W = {p(x) e P± | p(l) + p(—1) = 0 and p(2) +p(—2) = 0}. 

Find a basis of the subspace W and determine the dimension of W. 

Proof. 

Let p(x) 6 W and write 

p(x) = + ape + a 2 x 2 + a 

for some coefficients a 0 , a 1? a 2 , 0 3 , a 4 6 R. 

We first compute the values of p(x) at x = ±1 and x= ± 2 . 

We have 


p{ 1) = <Zq + aj + a 2 + + a 4 

/?(1) = _ ^3 + a 4 

p( 2) = <Zq + 2a j + 4 a 2 + + 16 a 4 

/?( - 2) = <Zq - 2a i + 4^2 _ 8a 3 + 16 « 4 . 


Since p(x) is in W, it satisfies 

/?(1) +p(- 1) = 0 and /?(2) + p{ - 2) = 0, 

and thus we have 


p( 1) +p( - 1) = 2 czq + 2^2 + 2^ 4 = 0 
p(2) +p( — 1) = 2 <2 q + 8^2 + 32 a 4 = 0. 


Dividing them by 2 , we have the system of linear equations in a 0 , a 2 , a 4 . 

a 0 + a 2 + a 4 = 0 
+ 4a 2 + 16^4 = 0 . 


We reduce the augmented matrix of the system by elementary row operations as follows. 


1 


■ 1 

1 

1 

0" 

r 2 

-*1 

■1 

1 

1 

0" 

3*2 

1 

4 

16 

0 



0 

3 

15 

0 



'I 

1 

1 

0- 


r 2 

■ 1 

0 

-4 

0" 


0 

1 

5 

0 



0 

1 

5 

0 * 


Hence the solutions to the system is 










and a 4 is a free variable. 


Substituting these relations into p(x), we obtain 


p(x) = 4 a 4 + ape - 5 ape 2 + ape^ + ape^ 

3 74 

= ape + ape + a 4 {4 - 5 x + x ). 


Let 


p x {x)=x 
p 2 (x) = x 3 


£> 3 (x) = 4 - 5 x z + 


■2± v 4 


These are vectors in W. 

(To see this, you may directly check the defining relations, or set a l = a 3 = 0 and a 4 = 1 to get p^(pe). 
Similarly for^^x) and p 2 (x).) 

Then by the above computations, any vector p(x) in W is a linear combination 

p(x) = a x pi(x) + a 3 p 2 (x) + a4p 3 (x). 

Thus, {pi(x),p 2 (x),p 3 (x)} is a spanning set of W. 


Also, the vectors pi(x),p 2 (x),p 2 (x) are linearly independent. 

In fact, if we have c^pi(x) + c^^x) + cp) 2 (pe) = 0 , then we have 

o = C x p x (x) + C 2 p 2 {x) + CtP 3 (x) 

= C jX + C 2 x 3 + C 3 (4 - 5x 2 + x 4 ) 

= 4c 3 + c pc — 5 c 3 x 2 + c 2 x 3 + CjX 4 . 

Since the coefficients must be zero, we have c 1 = c 2 = c 3 = 0 . 

This proves that p x {x),p 2 {pe),p 2 {pe) are linearly independent. 


Therefore the set {p x {x),p 2 {pe),p 2 {x)} is linearly independent spanning set of the subspace W, hence it 
is a basis for W. 

Thus the dimension of the subspace If is 3 . 


^ Let pi (x ), P2 (x ), P3 (x ), P4 [x) be (real) polynomials of degree at most 3 . Which (if any) of the following two 
conditions is sufficient for the conclusion that these polynomials are linearly dependent? 

(a) At 1 each of the polynomials has the value 0 . Namely pi ( 1 ) = 0 for i — 1 , 2 , 3 , 4 . 


(b) At 0 each of the polynomials has the value 1. Namely pi (0) — 1 for i — 1, 2, 3, 4. 
(University of California, Berkeley) 



© 

O 


The set P 3 consisting of all polynomials of degree 3 or less is a vector space of degree 4. 


Consider a subset of P 3 satisfying the condition (a) and prove that its dimension is less than or equal to 


3. 


Proof. 

(a) At 1 each of the polynomials has the value 0. 

We show that the condition (a) is sufficient. Let us denote by P 3 the vector space consisting of all polynomials (with the 
variable x) of degree 3 or less. Then the dimension of P 3 is 4. 

Consider 


W := {p(x) e P 3 | p( 1) = 0}. 

Then this is a subspace of P 3 . (Check it or see another solution below.) 

Since, for example, the polynomial q(x) — x E P 3 is not in W, the subspace W is a proper subspace of P 3 . Hence 
dim(fF) < dim(P 3 ) — 4. 

(Actually, the dimension is 3, see another solution below.) 

Since the dimension of W is less than or equal to 3, any four vectors in W must be linearly dependent. Thus pi are 
linearly dependent. 

Another Solution of (a) 

This solution is simpler than the previous solution. 

Consider a map P 3 —>• M sending p{x) to p(l) (evaluating at x — 1). 

Then this is a linear transformation. The kernel of this linear transformation is 

W := {p(x) e p 3 | p( 1) = 0 }. 

(As it is the kernel, W is a subspace of P 3 .) 

By the rank-nullity theorem , we see that dim(fF) = 3. Hence any four vectors in W are linearly dependent. 

(b) At 0 each of the polynomials has the value 1. 

We show that the condition (b) is not sufficient. 

In fact, the following four vectors satisfy the condition (b) but they are linearly independent: 

1,1 + 1 + X + x 2 ,1 + x + x 2 + x 3 . 

(Observe that the method of the proof of (a) does not work because 

V = {p(x) e P 3 | p( 0 ) = 1} 


is not a subspace.) 



Comment. 

The second proof of (a) is simpler and I think better than the first proof. 

The reason I wrote the first proof is that it is more accessible for the first year linear algebra students. 


• Let V be the vector space of all 3 X 3 real matrices. Let A be the matrix given below and we define 

W = {M e V | AM = MA}. 

That is, W consists of matrices that commute with A. Then W is a subspace of V . Determine which matrices are in the 
subspace W and find the dimension of W. 

(a) 


A = 


a 0 
0 b 
0 0 


0 

0 

c 


where a, 6, c are distinct real numbers, 
(b) 


a 


A = 


0 

0 


0 0 
a 0 
0 b 


where a, b are distinct real numbers. 

Solution. 


(a) Diagonal matrix with distinct diagonal entries 


Let us first determine when a matrix M commutes with A. 
Let 


a u 

a 12 

a n 

a 2\ 

a 22 

a 23 

a 31 

a 2>2 

a 33 


and suppose that AM = MA: 


'a 

0 

O' 


'a ii 

a \2 

■ 

co 

sT 


' a 11 

a \2 

■ 

CO 


' a 

0 

o- 

0 

b 

0 


a 2l 

a 22 

a 23 

= 

a 2\ 

a 22 

a 23 


0 

b 

0 

0 

0 

c 


«31 

a 32 

a 33 


a 3\ 

a 32 

a 33 


0 

0 

c 


Computing matrix products, we obtain 


B aa n 

aa u 

aa i3 ' 


u a n a 

<2 12 b 

a l3 c " 

ba 2x 

be 2 22 

ba 23 

= 

a 2l a 

<2 22 b 

a 23 c 

Ci 

Si 

OJ 

ca 32 

ca 33 


a 3l a 

a 32 b 

a 33 c 


Compare the ( 1 , 2 ) entries and we have aa u = ba l2 . 
Since a±b, we must have <z 12 = 0 . 




















Similarly, comparing the off-diagonal entries and noting a , b , c are distinct, we find that off diagonal entries 
a-, i ^ j must be 0. 

Thus, M commutes with A if and only if 


M = 


0 

0 


0 0 " 

a 22 0 
0 a 33 


Therefore, the subspace W consists of all 3 x 3 diagonal matrices: 

W= {WE V | W is diagonal}. 

Then it is easy to see that the set {E n , E 22 , £33} is a basis for W, where Ey is the 3 x 3 matrix whose (z,y)-entry is 
1 and the other entries are zero. Thus the dimension of W is 3 . 

(b) Diagonal matrix two diagonal entries are the same 

Now consider the case 


' a 


A = 


0 

0 


0 O' 
a 0 
0 b 


Let 


a 11 

a 12 

a 13 

a 2 \ 

a 22 

a 23 

a 3 \ 

a 32 

a 33 


and compute AM = MA as in part (a) (or you just need to replace b , c in (*) by a , b , respectively) and obtain 


u aa n 

aa l2 

aa \3 ' 


u a n a 

a l2 a 

a \ 3 b " 

aa 2 \ 

aa 22 

aa 2 3 

= 

a 2 \ a 

a 22 a 

a 23 b 

ba 3 l 

b a 32 

ba 33 


a 3 \ a 

a 22 a 

a 33 b 


Comparing entries and noting a ^ b, we have 

a \3 = 0 ’ a 23 = 0 ’ a 3 l = 0 ’ a 32 = 0 * 
Thus, M commutes with A is and only if 



■«n 

a 12 

0 " 

M = 

a 21 

a 22 

0 


0 

0 

a 33 


and hence the subspace W consists of such matrices. 
From this, we see that the set 














f^ll’ E 12 , ^21’ ^22’ ^33} 

is a basis for W, and we conclude that the dimension of W is 5 . 

• Let U and V be finite dimensional subspaces in a vector space over a scalar field K. Then prove that 
dim(£/ + V) < dim (U) + dim(V). 

Definition (The sum of subspaces). 

Recall that the sum of subspaces U and V is 

U+ V= {x + y | x G U,y E V}. 

The sum U+ Vis a subspace. 

(See the post “The sum of subspaces is a subspace of a vector space” for a proof.) 

Proof. 

Let n = dim(£/) and m = dim(F)- 
Let 


B l = {«i> ■■■>«,,} 

be a basis of the vector space U and let 

B 2 = {Vj, 


be a basis of the vector space V. 

An arbitrary element of the vector space U + W is of the form x + y, where x G U and y G V. 


Since B 1 is a basis of U, we can write 


x = r 


l u i + - + r n u n 


for some scalars r 1? ...,r n E K. 

Also, since B 2 is a basis of V, we can write 

y = s l y l + + s m y m 


for some scalars s l9 ..., s m E K. 


Thus, we have 


x + y = r x \\ x + ••• + r n u n + + - + s m x n 


and hence x + y is in the span S:= Span(u 1? ..., u n , v 1? ..., v m ). 

Thus we have U + W S and it yields that 

dim(£/ + W) < dim(*S) <n + m = dim (U) + dim(L)- 


This completes the proof. 


Related Question. 


Let A and B be m x n matrices. 
Prove that 


rank(^ + B) < rank(v 4 ) + rank (B). 


• Let A and B be m x n matrices. Prove that rank(A + B) < rank(A) + rank(L?). 

Proof. 

Let 


A = [ai,...,a„] and B — [bi,...,b„], 

where a i and are column vectors of A and B , respectively. 

Then the rank of the matrix A is the dimension of the column space of A. 

That is, we have 


rank(^4) = dim( Span(ai,... ,a n )). 


Similarly, we have 

rank(i?) = dim( Span(bi,..., b n )) 


and 


rank(^4 + B) = dim( (Span(ai + bi,..., a n + b n )) 
since A + B = [ai + bi,..., a n + b n ]. 


We claim that 


Span(ai +bi,...,a„ + b„) C Span(ai,... ,a„) + Span(bi,... ,b„). 
Any vector x 6 Span(ai + bi,..., a n + b r( ) can be written as 

x = ri(ai + bi) 4-b r n ( a n + b n ) 


for some scalars r \,..., r n . 


Thus we have 


x = n(ai + bi) H-hr„(a„ + b n ) 

= (nai H- b r n a„) + (nbi H -b r„b n ) 

G Span(ai,...,a n ) + Span(bi,... ,b n ), 


and hence the claim is proved. 


Then we have 


rank(^4 + B ) 

= dim( (Span(a! + b l5 ... ,a n + b n )) 

< dim(Span(a 1 ,...,a„) + Span^,... ,b„)) 

< dim( Span(a 1 ,..., a n )) + dim( Span(b 1 ,..., b n )) 
= rank(vl) + rank(£?). 


Here we used the fact that for two subspaces U and V of a vector space, we have 

dim(C7 + V) < dim(C7) + dim(F). 

For a proof of this fact, see the post “Dimension of the sum of two subspaces“. 


^ Linear Transformation from R n to R m 


Definition 


Q A function T : M n -A M m is called a linear transformation if T satisfies the following two 
linearity conditions: For any x, y G M n and c G M, we have 

1. T(x + y)=T(x)+T(y) 

2. T(cx) = cT(x) 

The nullspace Af(T) of a linear transformation T : -A M m is 

A/^T) = {x G f I T(x) = 0 m }. 

The nullity of T is the dimension of J\f(T). 

The range 1Z(T) of a linear transformation T : R n -> R m is 

K(T) = {y G r I y = T(x) for some x G M"}. 

The rank of T is the dimension of 1Z(T ). 

The matrix representation of a linear transformation T : M n —> M m is an m X n matrix A such 
that T (x) = ^4x for all x G M n . 


Summary 


Let T : M n -A M m be a linear transformation. 

T(0 n ) — 0 m , where 0 n and 0 m are the zero vectors in and R m , respectively. 

The matrix representation A of a linear transformation T : M n -A M m is given by 
A = [T(eiT(e n )] , where ei ,. . ., e n are the standard basis for M n . 

If A is the matrix representaiton of a linear transformation T, then 

1. Af(T) = J\T (A) and n(T) = 7 1(A). 

2. The nullity of T is the same as the nullity of A. 

3 . The rank of T is the same as the rank of A. 


Problems 

Let T : R 2 —?► R 3 be a linear transformation such that 

T(ei) = ui and T(e 2 ) = u 2 , 
where {ei, e 2 } is the standard unit vectors of R 2 and 
















Ui 

"5" 

1 

and u 2 — 

"8" 

2 


2 


6 


Then find T 


(a) Solution 1. (Using the Gram-Schmidt process) 

It is straightforward to check that the vectors v 1, v 2 are linearly independent, and hence the set S is a 
basis of Span(5) . 

Since the dot (inner) product of Vi and V2 is 


Vl • V 2 = 1 ^ 0 , 

S is not an orthogonal basis. We apply the Gram-Schmidt process to generate an orthogonal basis from the 
basis S . 

The Gram-Schmidt process for two vectors is as follows. We define vectors Ui, U2 by the following 
formula. Then B — {ui, U2 } is an orthogonal basis of Span (S) . 

ui := vi 

U 1* V 2 /*x 

u 2 :=v 2 -ui. (*) 

ui • Ui 


Since we have 


Ui • v 2 = Vi • v 2 = 1 and ui • ui = vi • vi = 2 , 


we compute 


Therefore the set 


U 2 = V 2 - -Ui 


O' 


T" 

1 

1 

0 

1 

~ 2 

1 

0. 


.0. 


- 1 / 2 ' 

1 

1/2 

0 



'- 1 ' 

1 

2 

" 2 

1 


. 0 . 


























is an orthogonal basis of Span (S) . 

Note that scaling by a nonzero scalar does not change the orthogonality, the set 



is also an orthogonal basis of Span (S) , just in case you prefer not to have a fraction. 

(a) Solution 2. (Using a pattern of the Gram-Schmidt process) 

Here is another solution using a partial information of the Gram-Schmidt process. 

As in Solution 1, the set S is a (non-orthogonal) basis of Span(S') . 

We want to apply the Gram-Schmidt process but suppose we only remember the pattern of the Gram- 
Schmidt process. Namely, we want to define orthogonal vectors u, u 2 by 


Ui := vi 

u 2 := v 2 + CLM\. 


Here a is some number, which is given in the Gram-Schmidt process (*) but we don’t remember. 
We can still determine the number a as follows. Since m and u 2 will be orthogonal, we have 

0 = ui • u 2 = ui • (v 2 + avLi) 

= ui • v 2 + aui • ui 
— 1 ~\~ 2 u. 


Hence, we obtain a — — 1 / 2 . 
Then we determine 


1 

U2 = V 2 - -Ui 


’o' 


"1" 

1 

1 

0 

1 

' 2 

1 

.0. 


.0. 


■-1/2 

1 

1/2 

0 



(So we could complete the Gram-Schmidt process even though we didn’t remember the details.) 
Hence the set 






















is an orthogonal basis of Span (S’) . 

(b) Solution. 

We first express the vector 


Then we compute 

T 


3 

-2 


Therefore we have found 


3 

-2 


as the linear combination 


3 

-2 


= 3 


- 2 


= 3ei — 2eo. 


= T{ 3d - 2e 2 ) 

= 3T( ei ) -2T(e 2 ) 



'5' 


"8" 

3 

1 

- 2 

2 


2 


6 


"15' 


'16' 


'-1" 

3 

— 

4 

= 

-1 

. 6 . 


.12. 


. —6. 




1 

I—* 

' 3 ‘ 

) = 

-1 

.-2. 

) 




.-6. 


by linearity of T 


Let T : M n —> M m be a linear transformation. Let 0 n and 0 m be zero vectors of R n and M m , respectively. 
Show that T(0 n ) — 0 m . 


Steps. 

Express 0/2 — 0^ 0/2 or 0/2 — 0*0/2* 

{ Calculate r(0 w ) using step 1 and the definition of linear transformation. 

I will give three proofs. 

Proof 1. 

Since 0 W = 0^ + 0 W , we have 

T(0 n ) = T{ o n + o„) = r(o„) + r(0„), 

where the second equality follows since T is a linear transformation. 



























Subtracting T(O n ) from both sides of the equality, we obtain 0 m — T(0 n ). 

Note that 0 m — T (0^) — T (0^) since T (0^) is a vector in . 

Proof 2. 

Observe that we have 0-0^ = 0^ . (This is a scalar multiplication of the scalar 0 and the vector 0^ . 
Now we have 


T(0 n ) = T(0 • 0*) = 0 • T(0 n ) = 0 m • 

Here we used the one of the properties of the linear transformation T in the second equality. 
Proof3. 

Note that 0^ = 0^ — 0^ . 

Thus we have 


T(0 n ) - T(0 n - 0„) - T( 0„) - T( 0„) = 0 m , 

where we used the linearity of T in the second equality. 

In the last equality, note that the vector T{ 0 W ) is m -dimensional vector. 


Determine whether the function T : R 2 —>• R 3 defined by 

x+y 

x + 1 

3 y . 

is a linear transformation. 

Solution. 

The function T : 1R 2 — ^ R 3 is a not a linear transformation. 

Recall that every linear transformation must map the zero vector to the zero vector. 
However, we have 




r( 

'o' 

.0. 

)- 

'0 + 0' 
0 + 1 

— 

'O' 

1 


'o' 

0 



3 • 0 


0 


0 


So the function T does not map the zero vector 
Thus, T is not a linear transformation. 




'o' 

'o' 

to the zero vector 

.0. 

0 



.0. 


Another solution 

Another way to see this is, for example, as follows. 
Let 



i' 

and v = 

'o' 

U = 



0 


1 






















(In fact, you may take any two vectors.) 
Then we have 

T(u) + T(v) = T 

On the other hand, we have 






'1 


" 1 " 


"2" 

'1 

)+t( 

" o ' 

) = 

2 

+ 

1 

_ 

3 

.0. 

) \ 

.1. 

) 










.0. 


.3. 


.3. 


T(u + v) = T 


Therefore, we see that 

T(u) + T(v)^T(u + v) ; 
and hence T is not a linear transformation. 


Let T : R 2 —> R 2 be a linear transformation. 
Let 


u — 



3 

5 


be 2-dimensional vectors. Suppose that 


T(u) = T 


T(v) = T 




Let w — 


x 

.y. 


E R 2 • Find the formula for T(w) in terms of x and y. 


Solution. 

Note that the vectors u, v are basis vectors for 1R 2 . 

Thus we can write the vector w as a linear combination of u and v . 
Let 


aiu + tf 2 V = w. 


We want to determine a\ and a 2 . 

So we consider the augmented matrix 


1 3 

X 

2 5 

y _ 


Applying the elementary row operations, we obtain a reduced row echelon form matrix for this matrix as 
follows. (This is the Gauss-Jordan elimination method.) 































1 

3 

X 

R 2 -2Rl 
-> 

1 

3 

X 

2 

5 

7 _ 


0 

-1 

1 

to 

1_ 


1 

3 

X 

R 1 -SR 2 
-> 

1 

0 

x — 3(2x — y) 

0 

1 

1 

1 

<£i 

0 

1 

2 x — y 


Therefore we have 

a\ —x — 3(2x — y) = —5x + 3 y 
<22 = 2x — y 


and the linear combination is 

w = (—5x + 3y)u + (2x — y)v. 

Now we use the linearity of the linear transformation T , we calculate T (w) as follows. 

T{ w) = r((-5x + 3y)u + (2 x - y)v) 

= (-5x + 3j)r(u) + (2 x-y)T(\) 

— (—5x + 3 y) 

15x — 9y + 14x — 7y 
. —25x + 15y + 2x — y 

29x — 16 y 
_ —23x + 14y . 


+ (2x-j) 


Thus the formula is 


r(w) 


29x — 1 6y 
—23x T - 14j 


Comment. 

After you obtained the formula, check the formula with w 


and see if you get 


-3 

5 


If the formula gives a different vector, then you made some mistake. 
To be more confident, do the same with w = 

5 


Let {Vi, V 2 } be a basis of the vector space M 2 , where Vi = ^ 

transformation T : M 2 —>■ M 3 on the basis {vi, V 2 } is given by 


andV2 — 


1 

-1 


. The action of a linear 



"2" 


" 0 ' 

T(v 1 ) = 

4 

and T(v 2 ) — 

8 


_ 6 _ 


10. 


Find the formula of T(x), where x = 


x 

IVi 











































Solution 1 (linear combination) 

Since we know the values of T on the basis vectors v i , v 2 , if we express the vector x as a linear 
combination of v i , V 2 , we can find ^(x) by the linearity of the linear transformation T. 

So let us find the scalars C\ , c 2 such that 


x = Ci v 1 + c 2 v 2 . 


We write this as 


x 

LT 


ci 


+ c 2 


1 

-1 


1 1 

1 -1 


Cl 

LC2 


The matrix 


1 1 

1 -1 


is invertible (as its determinant is —2) and its inverse matrix is 


'1 1 ' 

1 

'1 1 ' 

.1 - 1 . 

= 2 

.1 - 1 . 


Thus, we have 


Cl 


'1 

1 ' 

.C 2 _ 


.1 

-1. 


-1 


1 1 
1 -1 

x + y 
x — y 


Therefore, we obtain the linear combination 


X = ^(x+y)v 1 + -!(. x-y)v 2 ■ 


Now we compute T(x) as follows. 
We have 


T ( x ) = T ( ^(*+y)vi + |(x-j)v 2 


■l(x+j)r(v 1 ) + -t(x- y)T(v 2 ) by linearity of T 


= 2 ( x +y) 


"2" 

1 

■ 0' 

4 

+ 2 ( x ~y) 

8 

.6. 


.10. 


x+y 
6x — 2 y 
_ 8x — 2y 





































Hence the formula is 


r(x) = 


X + J 

6x — 2 \y 
8x — 2 \y 


Solution 2 (Matrix representation) 


In the second solution, we use the matrix representation for the linear transformation T. 


Let A be the matrix of T with respect to the standard basis { 
Thus, we have T (x) = Ax by definition. 

To find the matrix A , we compute 


1 ' 


'o' 

. 0 . 

? 

. 1 . 


} ofM 2 . 


= ^[v i v 2 ] 

= [Av i Ay--, 

2 0 ' 

4 8 

6 10 


It follows that we have 


A = 


We now be able to find T (x) as follows. 
We have 


0 

8 


6 10 


0 

8 


6 10 . 

1 1 

6 -2 

8 -2 


r(x) = Ax 


x+y 
6x — 2 y 
_8x — 2y 


which is, of course, the same formula that we obtained in solution 1. 




























Q Let T: 


i 3 —> M 2 be a linear transformation such that 


T(e i) = 


,r(e2) = 


,r(e 3 ) = 


where 


e l 


1 

T—1 

1 _ 


" 0 " 


o 
_ 1 

0 

5 e 2 = 

1 

5 e 3 

0 

1 

o 

_ i 


. 0 . 


_ 1 _ 


are the standard unit basis vectors 
xi 

For any vector x = cc 2 

_Z3J 


, find a formula for T(x). 


Hint. 

Recall the definition of a linear transformation T : IR 3 —> IR 2 . A map T is a linear transformation if the 
map T satisfies: 

Q T (u + v) = T (u) + T (v) for any u, v G R 3 , and 


T (cv) = cT (v) for any v G 1R 3 and c E R. 


Solution. 

Using the standard unit basis vectors, any vector x = 
combination 


Xi 

X 2 
A3 


G 1R* 5 can be expressed as a linear 


x — x i e i +x 2 e 2 +x 3 e 3 . 

Since T is a linear transformation, we have 

T(x) = T(x iei +x 2 e 2 +x 3 e 3 ) 

= xiT(ei) +x 2 r(e 2 ) +x 3 T(e 3 ) 


Xf 


+ x 2 


+ x 3 


x i + 2 x 2 + 3x 3 
4xi + 5x 2 + 6x 3 


Therefore the formula is given by 


r(x) 


x i + 2x 2 3x 3 
4x i + 5x 2 + 6x 3 


Let T: 


—y 


' be a linear transformation such that it maps the vectors Vi, v 2 as indicated in the figure below. 































Find the matrix representation A of the linear transformation T. 

Solution 1. 


From the figure, we see that 


vi = 


and 


r( Vl ) = 


-3 

1 


and V 2 


and T(v 2 ) = 


Let A be the matrix representation of the linear transformation T. By definition, we have 
T(x) = Ax for any x € M 2 . 


We determine A as follows. 
We have 


2 1 
2 3 


= [T( Vl ),r(v 2 )] 

= [Avi,Av 2 ] 

= A[vi,v 2 ] 

-a'-® 5 

1 2 


Note that the determinant of the leftmost matrix is —11, hence it is invertible and the inverse is given 
by 


'-3 5' 

1 

'-2 

5' 

. 1 2. 

11 

. 1 

3. 


Hence 























2 

.2 

11 

l_ 

11 



-3 5 
1 2 



13 

19 


-l 


5 

3 


Therefore, the matrix representation of T is 

„ 1 [-3 131 

■ 4 = n[-i i.J- 

Solution 2. 

2 

Let {ei, e 2 } be the standard basis for R . 

Then the matrix representation A of the linear transformation T is given by 


A=[T(e 1 ),T(e 2 )]. 


From the figure, we see that 


vi = 


and 


TM = 


-3 

1 


and V 2 = 


and T(v 2 ) = 


The values of T(ei ) and T(e 2 ) are not indicated in the figure but we can determine these values as 
follows. 

Note that { Vi, V 2 } is a basis for R 2 (as they are linearly independent from the figure). 

Let us express ei as a linear combination of v — 1 and V 2 . 

Let 


ei = civi + c 2 v 2 , 

where C\ , C 2 are scalars to be determined. 


This is equivalent to 


As we have 


it follows that 


1 

0 


[vi,v 2 ] 


Cl 

.C 2 _ 


'-3 5' 


Cl' 

. 1 2. 


.c 2 . 


'-3 5' 

1 

'-2 

5' 

. 1 2. 

11 

. 1 

3. 


































Cl 


'-3 5' 

-1 

'1' 

1 

'-2 5" 


'1' 

_C2_ 


. 1 2. 


.0. 

" 11 

. 1 3. 


.0. 


Thus, we obtain the linear combination 


2 1 

ei = “H Vl+ n V2 ’ 


By the linearity of T, we have 


T(e,) = -^-r(v 1 ) + T.r(v 2 ) 


'2' 

1 

+ tt 

i' 

.2. 

n 

.3. 

-3 



-1. 




Similarly, let 


e 2 = rfivi + d 2 ~v 2 , 


where di , d 2 are scalars to be determined. 
Then we have 


'O' 


-3 5' 

'df 

.1. 


. 1 2. 

.d 2 . 


It follows that 


di 


-3 5' 

-1 

'O' 

1 

'-2 5' 


O' 

_d2. 


. 1 2. 


.1. 

= 11 

. 1 3. 


.1. 


Thus, we have the linear combination 


5 3 

62 = TT Vl + n V2 ’ 


and by linearity of T we obtain 


r(e2) = n r(vi)+ n r(v2) 


_5_, 

11 

_5_ 

11 

11 


2 

2 

13 

19 


' 11 


3 

+ —— 


11 

.3. 


In conclusion, the matrix representation for T is 


J_ |j -2 

IT 1 


T 5 

11 3 











































A=[T(e 1 ),T(e 2 )] = 


11 


0 Let T be a linear transformation from M 3 to M 2 such that 



"o' 

\ 

" 1 

( 

"0" 

i 

1 

.0. 

- 

1 

2 

and T 1 

1 

_ 1 _ 


Then find T 


(The Ohio State University) 


Solution. 


We first express the vector 


as a linear combination 


"o' 


"o' 


i 

= Cl 

l 

+ C 2 

.2. 


.0. 



Then we find that c\ — — 1 and C 2 = 2. Hence we obtain 


"O' 


"O' 


"O' 

1 

= — 

1 

+ 2 

1 

_ 2 _ 


. 0 . 


_1_ 


We now compute 




3 13 
1 19 



by linearity of T 


Therefore we have found that 



















































^ Let T : 


—y 


' be a linear transformation such that 


T(e i) = 


,r(e 2 ) = 


,r(e 3 ) = 


where ei, e 2 , e 3 are the standard basis 
Then find the rank and the nullity of T. 
(The Ohio State University) 


Solution. 


The matrix representation of the linear transformation T is given by 

A = [T( ei ),T(e 2 ),T(e 3 )] = 


1 0 1 
0 10 


Note that the rank and nullity of T are the same as the rank and nullity of A. 
The matrix A is already in reduced row echelon form. 

Thus, the rank of A is 2 because there are two nonzero rows. 


Another way to see this is to use the leading 1 method. It implies that the first two columns vectors 
form a basis of the range of A because the first two columns contain the leading 1 ’s. 

Thus, the rank (=the dimension of the range) is 2. 


The rank-nullity theorem says that 

rank of A + nullity of A — 3 (the number of columns of A ). 
Hence the nullity of A is 1. 

In summary, the rank of T is 2, and the nullity of T is 1. 


Let T : R 2 —> R 3 be a linear transformation such that 



(a) Find the matrix representation of T (with respect to the standard basis for R 2 ). 

(b) Determine the rank and nullity of T. 

(The Ohio State University) 


Solution. 

(a) Find the matrix representation of T . 


The matrix representation A of the linear transformation is given by 


a = [r( ei ),r(e 2 )], 



















where 


ei 


5^2 


are the standard basis vectors for ] 


To determine T (e i ), we first express e i as a linear combination of 
Let 


and 


as follows. 


ei 


= a 


This can be written as 


'3 

4" 


a 


'1 

.2 

3. 


b_ 


.0. 


The determinant of the coefficient matrix 
Hence we have 


3 4 
2 3 


is 3-3 — 4 • 2 = 1 0 and thus it is invertible. 


a 


'3 4' 

-1 

1 ' 


' 3 -4' 


1' 


' 3 ' 

_b_ 


.2 3. 


.0. 


.-2 3 . 


.0. 


.-2. 


Hence a — 3 and b — —2. 
It yields that 


ei = 3 


- 2 


It follows that 


r(ei) = r( 3 
= 3 T 


4" 

3. 

2 T 



' 1 " 


' 0 ' 


' 3 ' 

3 

2 

- 2 

-5 

= 

16 


3 


1 


7 


by linearity of T 


Similarly, we compute T(e 2 ) as follows. 
Let 

e2 = 


= c 


-\- d 


This can be written as 




























































'3 

4" 


a 


'O' 

.2 

3. 


b_ 


.1. 


Hence, we obtain 


C 


'3 4' 

-1 

'O' 


' 3 -4' 


'O' 


'-4' 

d_ 


.2 3. 


.1. 


.-2 3 . 


.1. 


. 3 _ 


and c — —4, d — 3. 
Hence 


e2 


and we have 


r(e 2 ) = T -4 


= -AT 


= -4 


= -4 


+ 3r 


" 1 " 


' 0 ' 


' -4 ' 

2 

+ 3 

-5 

= 

-23 

.3. 


. 1 . 


. -9 . 


by linearity of T 


Therefore the matrix representation A of T is 

A = [7’( ei ),7’(e 2 )] 

(b) Determine the rank and nullity of T. 


3 -4 

16 -23 
7 -9 


Note that the rank and nullity of T are the same as the those of the matrix representation A of T . 
In part (a), we obtained 


A = 


3 -4 

16 -23 
7 -9 


Let us first determine the rank of A . 
We have 


A = 


' 3 

16 

-4 ' 

-23 

*3-2*1 
-> 

' 3 

16 

-4 ' 

-23 

*1^*3 
-* 

" 1 

16 

-1 ' 

-23 

. 7 

-9 . 


. 1 

-1 . 


. 3 

—4 . 


* 2 - 16*1 


* 3 - 3*1 


'1 

- 1 " 

* 3 

'1 

-l 

*1 +*3 

'1 

O' 

R 2 O* 3 

'1 

o' 

0 

-7 

-» 

0 

-7 

-> 

0 

0 

-> 

0 

1 

.0 

- 1 . 


.0 

1 . 


.0 

1 . 


.0 

0 . 


Hence, the reduced row echelon form matrix of A has two nonzero rows. 
































































So the rank of A is 2. 


By the rank-nullity theorem, we know that 

(rank of A) + (nullity of A) = 2. 
As the rank of A is 2, we see that the nullity of A is 0. 


$ 


Define the map T : R 2 —> R 3 by T 



xi - x 2 

X\ + x 2 


(a) Show that T is a linear transformation. 

(b) Find a matrix A such that T(x) = Ax for each x G R 2 . 

(c) Describe the null space (kernel) and the range of T and give the rank and the nullity of T. 


Proof. 


(a) Show that T is a linear transformation. 

2 3 

To show that T : R -G K is a linear transformation, the map T needs to satisfy: 


(i) T(u + v) = T(u) + T(v) for any u, v E 1R 2 , and 

(ii) T(cv) = cT(v ) for any v G M 2 and cGK. 


To check (i), let 

~ui 

u = 

_ u 2 

We have 


(ui + Vl) - (u 2 + v 2 ) 

(Ui + Vi) + (u 2 + V 2 ) 

U2 + V 2 

Vi ~ V2 
Vi + V 2 

V2 

= T(u) + T(v). 


{ui - u 2 ) + (yi - v 2 ) 


Ui - u 2 


(ui + u 2 ) + (vi + v 2 ) 

= 

Ui + u 2 

+ 

U 2 +V 2 


_ U 2 _ 



T( u + v) = T 


Ui + Vl 
U2 + V2 



Thus condition (i) holds. 


To check (ii), consider any v 
Then we have 


Vl 

V2. 


and c G R. 






















T(cv) —T 



CV 1 — CV 2 
OTi + OT2 
CV 2 


= C 


Vl - ^2 
Vl + ^2 
V2 


= cT(v). 


2 3 

Thus condition (ii) is met and the map T is a linear transformation from R to R . 

(b) Find a matrix A such that T (x) = Ax for each xfR 2 . 

(b) Let 

0" 

1 . 

be the standard basis vectors of R 2 . 

The matrix A satisfying T (x) = Ax can be obtained as 

A = fr(ei) r(e 2 )]. 



Since we have 


r(ei) = 

" 1 " 

1 

and T(e 2 ) = 

‘-I' 

1 


0 


1 


we obtain the matrix 


A = 


1 

1 

0 


-1 

1 

1 


(c) Describe the null space (kernel) and the range of T and give the rank and the nullity of T 


Null Space and Nullity 

We fist find the null space of the linear transformation of T. 

Note that the null space of T is the same as the null space of the matrix A. 

By definition, the null space is 

M{T) = Af(A) = {x G R 2 I Ax = 0}. 

So the null space is a set of all solutions for the system Ax = 0. 

To find the solution of the system we consider the augmented matrix and reduce the matrix using the 
elementary row operations. (The Gauss-Jordan elimination method.) 

We have 



















" 1 

-1 

o 

i?2 — Rl 
-» 

" 1 

-1 

0 " 

1 

1 

0 

o 

2 

0 

0 

1 

0 


0 

1 

0 


R\ +i?3 

’ l 

0 

o " 

i?2 —2Ri 


- » 

0 

l 

0 

theni ?2 <5-^3 

0 

0 

0 


Thus the system has only zero solution 


X\ = 0, X2 = 0. 


Therefore we obtained 


AT(T) = AT (A) 


0 

0 


Since the nullity is the dimension of the null space, we see that the nullity of T is 0 since the 
dimension of the zero vector space is 0. 


Range and Rank 

Next, we find the range of T. Note that the range of the linear transformation T is the same as the 
range of the matrix A. 

We describe the range by giving its basis. 


The range of A is the columns space of A. Thus it is spanned by columns 


"l" 


'-i‘ 

l 

? 

l 

.0. 


. l . 


From the above reduction of the augmented matrix, we see that these vectors are linearly independent, 
thus a basis for the range. (Basically, this is the eading 1 method.) 

Hence we have 


and 


K(T) = K(A) = Span { 


/ 

" 1 " 


"- 1 " 

1 

< 

1 

5 

1 

\ 

< 

. 0 . 


. 1 . 

1 


/ 

" 1 " 



\ 

< 

1 

? 

1 

> 

< 

. 0 . 


. 1 _ 

> 


is a basis for 7 Z(T). 


The rank of T is the dimension of the range TZ{T). 
Thus the rank of T is 2. 


Remark that we obtained that the nullity of T is 0 and the rank of T is 2. This agrees with the rank- 
nullity theorem 































(rank of T ) + (nullity of T) — 2. 


Let T : R 4 —>■ R 3 be a linear transformation defined 


by 


T 



Xi 

\ 



£2 




£3 




_ X 4 _ 

) 



+ 2x 2 + 3x 3 - x 4 
+ 5 x 2 Hr 8 x 3 — 2x 4 
xi + x 2 + 2x 3 


(a) Find a matrix A such that T(x) = Ax. 

(b) Find a basis for the null space of T. 

(c) Find the rank of the linear transformation T. 
(The Ohio State University) 


Solution. 

(a) A matrix representation of a linear transformation 

Let e 1? e 2 , e 3 , and e 4 be the standard 4-dimensional unit basis vectors for R 4 . 

Then the matrix representation for the linear transformation is given by the formula 




T(e 3 ) T(e 4 )] 


Then we calculate 



■1 1 

\ 


0 



0 



0 

■ ■ 

/ 


"1 ' 

3 

1 


Similarly, we compute and obtain 



"2 ■ 


.3 . 


"-1 1 

r(e 2 ) = 

5 

1 

, ne 3 ) = 

8 

2 

, T(t 4 ) = 

-2 

0 


Therefore the matrix A we are looking for is 


A = 


'I 

3 

1 


2 3 -1 1 

5 8-2 

1 2 0 


(b) The null space of the linear transformation 

First, note that the null space of a linear transformation T is the same as the null space of the matrix A 
that represents T. 

Thus, we find a basis for the null space of the matrix A we obtained in (a). 






















The null space of A consists of the solutions of Ax = 0 and so we apply Gauss-Jordan elimination 
process to solve the linear equation as follows. We apply the elementary row operations to the 
augmented matrix [A | 0] of A. 



'1 2 

: 3 


-1 

0- 

R 2 - 3 R \ 

■1 

2 

3 

-1 

O' 


1/4 10] = 

3 5 

; 8 


-2 

0 



0 - 

-1 

-1 

1 

0 



1 1 

2 


0 

0 

r 3 -r 

1 

0 - 

-1 

-1 

1 

0 


*3- 

-r 2 

■i 

2 

3 

-1 0- 

Ri 

- 2 R 2 

' 1 

0 

1 

1 

0- 


0 

1 

1 

-1 0 



0 

1 

1 

-1 

0 

then - R 2 

0 

0 

0 

■ 

0 

0 



0 

0 

0 

0 

0 


Therefore the solutions are 


X| = - - x 4 

X 2 = - X 3 + x 4 


and thus the solution x of Ax = 0 is of the form 


-■ 

1 

X 

u> 

1 

X 

_■ 


‘-1 1 


■-1 ■ 

-X3+X4 


-1 


1 

x 3 

= X 3 

1 

+ X 4 

0 

x 4 

1 1 


0 

1 1 


1 

■ ■ 


Hence we have 


MT) = MA) = 



and the set 



is a spanning set for the null space M(T). 

We claim that the set B is linearly independent. Indeed, if we have a linear combination 


























■-1 1 


'-I 1 


-1 


1 

a x 

1 

+ a 2 

0 


0 

■ ■ 


1 

■ ■ 


then we have 


1 ~a x - a 2 
-a x +a 2 


a 2 


= 0 


and thus a x = a 2 = 0 and the set B is linearly independent. 

Since B is a linearly independent spanning set for the null space Af(T), the set B is a basis for Af(T). 
In summary, we found a basis 



(c) The rank of the linear transformation 

We apply the rank-nullity theorem to find the rank of T. 

From part (b), we see that nullity, the dimension of the null space Af(T), is 2. 

The linear transformation is a map from R 4 to R 3 . Thus the rank-nullity theorem states that we have 

rank of T + nullity of T = 4 
rank of T + 2 = 4. 


Therefore the rank of the linear transformation T is 2. 


0Let T 

e i 


: R 2 —y R 3 be a linear transformation such that T(ei) 



0 

1 


are unit vectors of R 


2 


and 


Ui andT(e2) = u 2 , where 



"-1" 


"2" 

Ui 

0 

, U 2 = 

1 


_ 1 _ 


_ 0 _ 


Then find T 




A linear transformation from a vector space V to a vector space W is a map / : V -A W satisfying 
the following linearity properties: 























0 f(u + v) = /(u) + f(v) for any vectors u, v G V, and 


0 f(rv) = rf(y) for any vector v £ V and any scalar 


r. 


Note that the set {ei, e 2 } is a basis for the vector space R 2 . 

r 3 1 

Thus the vector can be written as a linear combination of the basis vectors ei, e 2 . 

2 

Solution. 

We first express the vector 


3 

-2 


Then we have 


3 

-2 


as a linear combination of ei and e 2 : 

3 ‘ 

= 3ei — 2e2- 

2 


— T(3ei — 2e2) 

= 3T(ei) — 2T(e2) by the linearity of T 

— 3ui — 2 u2 



‘-I' 


"2" 

3 

0 

- 2 

1 


1 


0 


-7' 

-2 

3 


Thus, we found 


3 

-2 


-7' 

-2 

3 


Let B — {Vi, V 2 } be a basis for the vector space M 2 , and let T : M 2 —M 2 be a linear transformation such that 

1 " 

—2 

If e : = Vi + 2 v 2 and e 2 = 2vi — u 2 , where ei, 
of T with respect to the basis {ei, e2 }• 

Solution. 



r(vi) = 


andT(v 2 )= ^ . 

e 2 are the standard unit vectors in M 2 , then find the matrix 


The matrix representation A of the linear transformation T with respect to the basis {e^, e 2 } is 
given by 


^ = [T( ei ) T(e 2 )]. 


(*) 

























To find the vectors T(ei),T(e 2 ) we use the linearity of T. 
We have 


T(ei) = T(v i + 2v 2 ) 

= T(vi) + 2T(v 2 ) (by the linearity of T) 

7" 

0 



Also we have 


r(ea) 


T(2vi - u 2 ) 
2T(vi) - T(u 2 ) 



3 

1 


Therefore the matrix A — [ T(ei ) T(e 2 ) ] of the linear transformation T is given by 


A = 


7 -1 
0 -5 


£| Let T : 


be a linear transformation such that 


,T 


Then find the matrix A such that T(x) = Ax for every x E R 2 , and find the rank and nullity of T. 
(The Ohio State University) 


Solution. 

The matrix A such that T (x) = Ax is given by 

A=[T(e 1 ),T(e 2 )}, 


where ei = 


> e 2 


are standard basis of . 


Since the vector T(e 2 ) is given, it remains to find T(ei). 
By inspection, we obtain the linear combination 


1 


1 


0 

0 


1 


1 


Thus, we have 
































by linearity of T 


It follows that the matrix A is given by 

A = 

The rank and nullity of T are the same as the rank and nullity of A. 

We reduced the matrix A by elementary row operations as follows: 

0 
1 


~\~Rl 

'1 

3' 


'1 

3' 

R\ — 3i?2 

-> 

.0 

5. 

-* 

.0 

1. 

- > 


1 3 

-1 2 


Hence the rank of A is 2 (because there are two non zero rows). The nullity of A is determined by 
the rank nullity theorem 

rank of A + nullity of A = 2 (the number of columns of A). 

Hence the nullity of A is 0. 

In a nutshell, the rank of T is 2, and the nullity of T is 0. 




If L : 


is a linear transformation such that 



then 

(a) find L 



, and 


(b) find the formula for L 
{Purdue University) 


X 

y _ 



Express 


1 

2 



'i 


'i' 

as a linear combination of 

0 

and 

i 













































m 

$ 


Use the linearity of linear transformation L . 


Same for part (b). Replace 


1 

2 


with the general vector 



Solution. 



Note that the vectors 


and 


are a basis of Kl 


We first express 

1 


Let 


= ci 


as a linear combination of the vectors 


+ C 2 


Cl + C 2 
C2 


Solving this, we have c\ — — 1 and c 2 — 2. 


Then we calculate 

L 


= L - 


+ 2 


= -L 


"1" 


"2" 


'3' 

1 

+ 2 

3 

= 

5 

_2_ 


_2_ 


_2_ 


and 


+ 2 L 


where the second equality follows from the linearity of L . 
Thus we have 


(b) Find the formula for L 


We generalize the proof of (a). 


X 

LJ 7 


Let 
Ci ~- 


ci 


C2 


y J 

=y. 


Cl + c 2 
C2 


be a linear combination. Solving this, we have 


Hence the linear combination is 


x 

iy 


= (x-y) 


+y 


Using the linearity of L , we compute 
































































X 

TJ 


= Ll (x-y) 


+y 


= (• x-y)L 



l 


2 


x+y 

(x-y) 

l 

.2. 

+y 

3 

. 2 . 

— 

x+ 2y 

2x 


+ yL 


Therefore the formula is 


x 

LTJ 


x+y 
x + 2y 
2x 


Suppose that T : 


—y . 


p3 , 


is a linear transformation satisfying 




"3" 




"0" 

"1" 

) = 

4 

and T \ 

'O' 

) = 

0 

2 

j 


v 

.1. 

J 



_5_ 




.1. 


Find a general formula for T 
(The Ohio State University) 


xi 

L%2 J 


Solution 1. (Using linear combination) 


Note that the set B := 


form a basis of the vector space . 


To find a general formula, we first express the vector 


Xi 

X2 


as a linear combination of the basis 


vectors in B. 

Namely, we find scalars C\ , C 2 satisfying 


Xi 

X2 



This can be written as the matrix equation 


~ Xi 

= p 

~Ci~ 

-%2_ 


_C2_ 


where we have put 


2 1 . 

The 2x2 matrix P is invertible as its determinant is det(P) = (1)(1) — (0)(2) = 1 7 ^ 0, and 
the inverse matrix is given by 


P~ l 


1 0 

-2 1 


Then we multiply P 1 on the left of (*) and obtain 



















































Cl 

C2 


= P 


-1 


Xi 

X 2 



"xi" 


_ x 2 _ 


1 0 

-2 1 

Xi 

-2xi + X 2 

Thus, we have determined the scalars 

ci = x\ and C 2 = — 2xi + # 2 , 
and the linear combination becomes 


Xi 

X2 


= Xl 


+ (- 2^1 + X 2 ) 


Now we compute T 


xi 

X2 


as follows. 


xi 

X2 


= T^x i 

= x\T 


- (-2xi + x 2 ) 

+ (— 2 xi + x 2 )T 



3 


'O' 

Xl 

4 

.5. 

+ (-2a:i + x 2 ) 

0 

_1_ 


3 xi 

4xi 

3 xi + x 2 


by linearity of T 


Therefore, the general formula is given by 



3xi 

4xi 

3 xi + x 2 


Solution 2. (Using the matrix representation of the linear transformation) 

The second solution uses the matrix representation of the linear transformation T. 

Let A be the matrix for the linear transformation T . 

Then by definition, we have 

T(x) = Ax, (**) 

for every xGK 2 . 

(Note that the size of A is 3 X 2 because T : R 2 —R 3 .) 








































We determine the matrix A as follows. 
We compute 


A 


1 

2 


0 

1 



3 0 

4 0 

5 1 


by (**) 


The 2x2 matrix on the left hand side is invertible, and the inverse is 


'1 o' 

-1 

' 1 

O' 

.2 1 . 


i 

1 

to 

1 . 


(Remark: This is the matrix P in the first solution.) 


Multiplying by this inverse matrix on the right, we obtain 


A 


3 0 

4 0 

5 1 

3 0 

4 0 

5 1 

3 0 

4 0 
3 1 


1 

2 


0 

1 


-l 


1 0 

-2 1 


Now that we have obtained the matrix A for T, we can find the general formula as follows. 
We have 


T 



= A 


Xl 

-X 2 _ 


3 

4 
3 


Xl 

X2 


by (**) 


3cci 

4a?i 

Zxi + X2 


In summary, the general formula is 











































3xi 
4x i 

3a:i + x 2 


Let T be the linear transformation from the 3-dimensional vector space R 3 to R 3 itself satisfying the following 
relations. 


0 

2 

-1 


Then for any vector x = 


E R, find the formula for T(x) 


Solution 1 using the matrix representation. 

The first solution uses the matrix representation of T. 

Let A be the matrix representation of the linear transformation T with respect to the standard basis of 

M 3 . 

Then we have T(x) = Ax. by definition. 

We determine the matrix A as follows. 

We have 


"1 

2 

O' 



"T 


'2' 


'O' 


1 

3 

1 

= 

A 

l 

, A 

3 

, A 

1 


.1 

5 

2. 



.1. 


.5. 


.2. 

_ 






T' 


'2" 


'O' 

- 




= 

T 

l 

, T 

3 

, T 

1 







.1. 


.5. 


.2. 



0 

2 


It follows that we have 

A 

Let us find the inverse matrix by 


T o i' 


'1 2 O' 

0 2 0 


13 1 

.1 -1 0. 


.1 5 2. 


using augmented matrix. 












































Ri — 2R2 
R s - 3 R 2 


1 

0 


0 

1 


0 0 -] 


Rl + 2 i ?3 

R2 —R3 


10 0 
0 10 
0 0 1 


" 1 

2 0 

1 

O 

O 

R2 

-Ri 

’ 1 

2 

0 



R3 

—Ri 



1 

3 1 

0 

1 0 


- > 

0 

1 

1 

1 

5 2 

0 

0 

1 



0 

3 

2 

-2 

3 

-2 

0 " 


-R3 

" 1 

0 

-2 

r 

1 

-1 

1 

0 


—» 

0 

1 

1 

-i 

-1 

2 

-3 

1 



0 

0 

1 

r 


10 0 
-110 
-10 1 

-2 0 
1 0 
3 -1 


-1 4 -2 

1-2 1 

-2 3 -1 


Thus we obtain the inverse matrix 


and hence we have 


A = 


"1 2 O' 

-1 

'-1 4 -2' 

13 1 

= 

1-2 1 

1 5 2_ 


.-2 3 -1. 


'1 0 r 


'-1 4 -2" 


'-3 7 -3' 

0 2 0 


1-2 1 

= 

2-4 2 

.1 -1 0. 


.-2 3 -1. 


.-2 6 -3. 


Using the relation T (x) = Ax, the formula for T (x) is obtained as follows. 

T(x) = Ax 

-3 7 -3' 

2-4 2 

-2 6 -3 


-3x + 7y — 3z 
2a; — 4 y + 2 z 
_— 2x + 6 y — 3z 



X 


y 


_z _ 


Solution 2 using a linear combination and linearity. 


The second method is to find the linear combination 


X = Cl 

'1" 

1 

+ C2 

"2" 

3 

+ c 3 

'O' 

1 


1 


5 


2 


and use the linearity of the linear transformation. 

To find the coefficients Ci, C 2 , C 3 , we consider the augmented matrix 


’ 1 

2 

0 

X 

1 

3 

1 

y 

1 

5 

2 

z 


and we reduce this matrix by elementary row operations. 

The reduction operations are exactly the same as in solution 1 and we obtain 














































ci = —x + 4y — 2z 
C 2 — x — 2 y + z 
C 3 = —2x + 3 y — z. 


Therefore, we have by the linearity of T 
T(x) = T (a 
( 


+ C2 


= c x T 


1 
1 

VL1. 


+ c 2 T 


+ c 3 


+ c 3 T 


/ r° 

1 

vU 





' 0 ‘ 


(—X + 4 y — 2 z) 

0 

+ (x — 2y + z) 

2 

+ (-2x + 3 y — z) 


1 


-1 



—Sx + 7y — 3z 
2x — 4y + 2 z 
—2x + 6y — 3z 


0 Let T : M. 2 M 3 be a linear transformation given by 

x 1 - x 2 
X2 

x 1 + £2 

Find an orthonormal basis of the range of T. 

(The Ohio State University) 



Solution. 

Let A be the matrix representation of the linear transformation of T . 
That is, 


A = [T{e l ) r(e 2 )], 


where 


0 
1 

form the standard basis of the vector space R 2 . 

By the formula, we see that 




and thus the matrix A for T is 









































A = 


1 -1 

0 1 

1 1 


Note that the range of T is the same as the range of A . 

We reduce the matrix A by the elementary row operations as follows: 


A = 

'1 

0 

- 1 " 

1 

1 

CO 

"1 

0 

- 1 " 

1 

R i +i?2 
-> 

"1 

0 

o' 

1 


1 

1 


0 

0 


0 

0 


Since the both columns contain the leading 1 ’s, we conclude that 






-1 

1 

1 


is a basis of the range of A by the leading 1 method. 

Note that the dot product of these basis vectors is 0, hence they are already orthogonal. 

Hence to obtain an orthonormal basis, we just need to normalize the length of these vectors to 1. 


In summary, an orthonormal basis of the range of T is 



1 

’v/3 


-1 

1 

1 


. 


This problem is one of the final exam problems of Linear Algebra course at the Ohio State University 
(Math 2568). 


€> Let T : M n —>• M m be a linear transformation. Suppose that S = {xi, x 2 ,..., x*. } is a subset of M n such that 
{T(xi), T(x 2 ), ..., T(x&)} is a linearly independent subset of M m . Prove that the set S is linearly 
independent. 


Hint. 

Recall the following definitions. 

^ Vectors xi,X 2 ,...,X£ are linearly independent 
if and only if the only solution to the vector equation 

Cixi + c 2 x 2 H-b CkX k = 0„ 

is Ci — c 2 = • • • = Ck — 0. 


O A linear transformation T : M” —>■ R m is a map such that 
^ r(v + w) = T (v) + w for all v, w G M” . 























^ T (cv) = cT (v) for all c E R and v G R 71 . 


Proof. 

Consider a linear combination of vectors in S 

C 1 X 1 + c 2 x 2 H-b c k x k = 0„, 

where 0^ is the n dimensional zero vector. 

To show that S is linearly independent, we need to show that the coefficients C\ are all zero. 

Recall that any linear transformation maps the zero vector to the zero vector. 

(See A linear transformation maps the zero vector to the zero vector for a proof of this fact.) 

Thus we have 

0 m = T( 0„) = r(cixi + c 2 x 2 4-h c k x k ) 

— ciT(xi) + c 2 r(x 2 ) 4-h c k T(x k ). 


In the last step, we used the linearity of T . 

Since the vectors 7 1 (xi),7 1 (x 2 ),...,7 1 (x^) are linearly independent, the coefficient of this linear 
combination of these vectors must be zero. 

Thus we have C\ — c 2 = • • • = c k — 0 , hence the set S is linearly independent. 


Let T be a linear transformation from the vector space M 3 to M 3 . 

Suppose that k — 3 is the smallest positive integer such that T k — 0 (the zero linear transformation) and suppose 
that we have x E R 3 such that T 2 x ^ 0. Show that the vectors x, Tx, T 2 x form a basis for R 3 . 

(The Ohio State University) 


Proof. 

We first recall that 7"(0) = 0 . To see this, we compute 

T( 0) = r(0.0) = 0 • T( 0) = 0. 

To show that x, 7"x, 7" 2 x form a basis, it suffices to prove that these vectors are linearly independent 

o 

since the dimension of the vector space R is three and any three linearly independent vectors form a basis 
in such a vector space. 

(For a different proof, see the post A linear transformation maps the zero vector to the zero vector.) 


Let us consider a linear combination 

aix + a 2 (rx) + a 3 (7’ 2 x) = 0, 


(*) 



where a \, a 2 , a 3 are real scalars. 


If the only solution is a\ = a 2 — a 3 = 0, then the vectors are linearly independent. 


Applying the linear transformation T on both sides, we obtain 

T (aix + «2(rx) + a3(r 2 x)) = 7"(0) = 0. 

Using the linearity of T , we have 

ai(Tx) + a 2 (T 2 x) + a 3 (T 3 x) = 0 


and since T 3 = 0, we have 


a\{Tx) + a 2 {T 2 x) — 0 




r 


We apply the linear transformation T again, and by using the linearity as above we obtain 

ai(r 2 x) + a 2 (T 3 x) = T( 0) = 0 


and since T 3 = 0, we have 


ai (T 2 x) = 0. 


Since T 2 x ^ 0 , we must have a\ — 0. 


Then from (**), we have a 2 {T 2 x ) = 0 , and hence a 2 — 0 since f 2 x/ 0 . 

Substituting a\ — a 2 — 0 in (*), we have a 3 {T 2 x ) = 0 , and thus a 3 — 0 since f 2 x 7^ 0 . 

Therefore the solution to (*) is a\ — a 2 — a 3 = 0, and hence the vectors 

x, Tx, T 2 x 


are linearly independent. 

We conclude, hence, that these vectors form a basis for IR 3 . 


Let n be a positive integer. Let T : M n —>• R be a non-zero linear transformation. Prove the followings. 

(a) The nullity of T is n — 1. That is, the dimension of the nullspace of T is n — 1. 

(b) Let B — {vi, • • •, v n _i} be a basis of the nullspace Af(T) of T. Let w be the n-dimensional vector that is 
not in Af(T). Then B' — {vi, • • •, v n _i, w} is a basis of M n . 

(c) Each vector u E M n can be expressed as 


u = v + 


n u) 

—7—r w 

n w) 


for some vector v E Af(T). 



Solution. 

(a) The nullity of T is n — 1. 

Let A be the matrix representation of the linear transformation T : WL n —> R. 

Then A is a 1 X n matrix and nonzero since T is a non-zero linear transformation. 

Thus, the rank of A is 1. So the rank of T is 1. 

Then the rank-nullity theorem 

rank of T + nullity of T — n 
yields that the nullity of T is n — 1. 

(b) B' is a basis of 1BL n . 

We claim that the n vectors v i, • • •, v n -i , w are linearly independent. Suppose that we have 

civi H-b +c n w = 0, 

for some c \,..., c n £ R . 

Observe that if c n ^ 0, then we have 

Cl Cn —l 

W = -Vi H-1-v„_i, 

Cn C n 

and thus w £ Span (B) —M{T ) , a contradiction. 

Hence c n — 0. 

Then the linear combination becomes 

civi H-h c«-iv«-i = 0, 

and since B — {v i, • • •, v n -i } is linearly independent (as being a basis), we must have 
ci = • • • = c n -1 = 0. 

As a result, all Cf are zero. So the vectors v i, • • •, v n -i , w are linearly independent. 

Since W 1 is an n -dimensional vector space and B 1 consists of n linearly independent vectors, the set B 1 
is a basis of WL n . 

(c) Each u £ W 1 can be expressed as u = v + w . 

r(wj 

Let u £ WL n . Since B 1 isa basis of R”, we can write 

u = CiVi H-b C n —\ V n —\ + C n W, 

for some C \,..., c n £ R. 

Let 

u = Civi H-+ c n - iv„_i + c„w. 






Then v is in J\T{T ) and we have 


U = V + c n w. 


n 


We determine c n as follows. 

Applying the linear transformation T , we obtain 

r(u) = T(v + c n w) 

— T (v) + c n T (w) by linearity of T 

= 0 + c„r(w) since v G Af(T) 

= c n T( w). 


Note that since w ^ A/"( T ), the real number T (w) is not zero. 
Hence, we obtain 

_ 

T(w) • 


Combining this with (*), it follows that 


with v G Af(T) . 
\end{ enumerate} 
\end {proof} 


u = v 


r(u) 

T(w) 


w 


Let V be the subspace of M 4 defined by the equation 

X\ — X 2 + 2 x 3 + 6 x 4 = 0 . 

Find a linear transformation T from M 3 to M 4 such that the null space A f{T) — {0} and the range 7 Z(T) = V. 
Describe T by its matrix A. 

Solution. 

Any vector 

X2 ^ T/ 

x = G V 

_ X4 _ 

can be written as 






X2 ~ 2x% ~ 6 X 4 
X2 
X 3 

X 4 


V 


'-2" 


'-6' 

1 

0 

+ Xz 

0 

+ X 4 

0 

0 

1 

.0. 


. 0 . 


. 1 . 


Let Vi, V 2 , V 3 be the vectors appearing in the above linear combination of x. 
Then it is straightforward to see that the set B — { Vi , V2 , V3 } is a basis of V. 

We define the linear transformation T : R 3 -A R 4 by 


T(x) = Ax, 


where 

1 -2 - 6 " 

1 0 0 

0 10’ 

0 0 1 _ 

Since B is a basis of V, in particular it is a linearly independent set. Thus, the columns of A is 
linearly independent. 

It follows that the null space Af(T ) = Af(A) = {0}. 

Also, the range of T is the same as the range of A, which is spanned by the columns of A. 
Thus, the range H(T) — Span (B) — V. 

By our definition of T, the matrix representation of T is A. 


A = [vi,v 2 ,v 3 ] = 


A more explicit formula for T is given by 






X\ — 2x2 6^3 




Xi 


X 2 


X 2 


_ X 3 _ 


X 3 


$ 


Let u — 


and T : M 3 —M 3 be the linear transformation 

T(*)=p,oj„ X =(^)u. 


(a) Calculate the null space M(T ), a basis for M(T) and nullity of T. 

(b) Only by using part (a) and no other calculations, find det(A), where A is the matrix representation of T with 
respect to the standard basis of M 3 . 

(c) Calculate the range 1Z(T), a basis for 1Z(T) and the rank of T. 

(d) Calculate the matrix A representing T with respect to the standard basis for M 3 . 























(e) Let B = 


to B. 


/ 

' 1 ' 


"- 1 " 


" 0 " 



X 

< 

0 

5 

1 

5 

-1 


> be a basis for M 3 . Calculate the coordinates of 

y 


.0. 


_ 0 _ 


_ 1 _ 

> 


_ z_ 


with respect 


(The Ohio State University) 


Proof. 


(a) Calculate the null space M ( T ), a basis forAf(T) and nullity of T. 


Letx = 


G M 3 and suppose that we have x G J\f{T ) . 


This means that T (x) = 0 
Thus we have 


Since u 7 ^ 0, we must have 


(—)" 


= 0 . 



'1" 


X 

0 

II 

£ 

X 

II 

1 


y 


.0. 


_z _ 


= x + y . 


This yields that the null space J\f {T ) consists of the vectors of the form 


X 


' 1 ' 


"o' 

—x 

= X 

-1 

+ z 

0 

z 


. 0 . 


.1. 


for any real numbers x, z . 
Thus, we have 


J\f{T) — Span < 



' 1 " 


'o' 

It follows from (*) that the vectors 

-1 

5 

0 


. 0 . 


1 


are linearly independent, hence they form a basis of 


A f{T). 

The nullity of T is the dimension of J\f (T ) by definition, and hence the nullity of T is 2. 


(b) Find the determinant det (A) 

Note that the nullity ofv4 is the same as the nullity of T . So by part (a), the nullity ofv4 is 2. This implies 
that the equation 


Ax — 0 



































has a nonzero solution x. 

Hence the matrix A is singular, and thus the determinant det(v4) = 0 . 


(c) Calculate the range 7Z(T), a basis for 7Z ( T ) and the rank of T . 

3 3 3 

Note that by the formula of the linear transformation T : R -A R , every vector xGl is mapped to a 
scalar multiple of the vector u. 

Also, since u is mapped to u itself, the linear transformation T is nonzero. 

Hence the range is 

7 Z(T) — Span(u), 

and a basis for 'IZ(T) is the set {u} . 

The rank of T is the dimension of the range TZ(T) by definition. 

Thus, the rank of T is 1. 


(d) Calculate the matrix^ representing T with respect to the standard basis for R 3 . 
Let 


'1‘ 


'o' 


'O' 

0 

= 

i 

II 

CO 

CD 

0 

.0. 


.0. 


.1. 


0 

be the standard basis vectors of R . 


We calculate 


r<ei) = (^r) u= i u= 
r<e2) = (^v) u= i u= 
r<e3) = (^v) u= ¥ u = 


1/2 

1/2 

0 

1/2 

1/2 

0 

0 
0 
0 


Thus, the matrix representation A of T is 


A = [r( ei ) r(e 2 ) r(e 3 )] 

' 1/2 1/2 0 " 

= 1/2 1/2 0 . 

0 0 0 



















(e) Calculate the coordinate vector 


Let v = 


and let 


\VB = 


be the coordinate vector of v with respect to the basis B . 
This means that we have 


v = a 

'i 

0 

+ b 

'-1" 

1 

+ C 

‘ o ' 

-l 


0 


0 


l 


To determine a, c, we solve the system 


1 -1 0 

0 1 -1 

0 0 1 



a 


b 


_c _ 


= v. 


The augmented matrix of the system is 


i 

-1 

0 

X 

0 

1 

-1 

y 

0 

0 

1 

z 


Applying the elementary row operations, we obtain 


1 

-l 

0 

X 

R 1 +^2 

1 

0 

-1 

X + y 

0 

l 

-1 

y 


-A 

0 

1 

-1 

y 

0 

0 

1 

z 



0 

0 

1 

z 




Ri 

Rt 

+R 3 
i+R 3 

" 1 

0 

0 

x +y + x 





- * 

0 

1 

0 


y + z 






0 

0 

1 


z 


Hence we have 


a=x+y + z,b=y + z,c = z, 


and the coordinate vector of v with respect to the basis B is 


\VB = 


x+y + z 
y + z 
z 
































^ We fix a nonzero vector a in M 3 and define a map T : M 3 —>• M 3 by 

T(v) = a x v 

for all v £ M 3 . Here the right-hand side is the cross product of a and v. 

(a) Prove that T : M 3 —> M 3 is a linear transformation. 

(b) Determine the eigenvalues and eigenvectors of T. 

Proof. 

(a) Prove that T : M 3 —> M 3 is a linear transformation. 

To prove that T is a linear transformation, we need to verify the following equalities for all 
u, v G R 3 and c G R: 

® T(u + v) = T(u) + T(v) 


^)T(cv) = cT(v). 


These follow from the basic properties of cross products as follows. 
We have 


T(u + v) = a x (u + v) 

= axu + axv the cross product is distributive 
= T(u) + T(v). 


As the cross product is compatible with scalar multiplication, we also have 
T(cv) = a x (cv) = c(a x v) = cT(v). 


Therefore T is a linear transformation. 

(b) Determine the eigenvalues and eigenvectors of T. 

Let A be an eigenvalue of the linear transformation T and let x be an eigenvector corresponding to A. 
Then we have 

T(x) = a x x = Ax. 

We take the dot product and obtain 

(a x x) • x = Ax • x = A||x|| 2 . 


Note that the vector a X x and x is orthogonal, hence the left-hand side of the above equality is zero. 
On the other hand, since x is an eigenvector, it is a nonzero vector. Hence the length ||x|| ^ 0. 

It follows that we have 0 = A||x|| 2 with ||x|| ^ 0, and thus A = 0. 

Hence the eigenvalue of T is A = 0. 


This yields that we have axx = 0- x = 0for any eigenvector x. 

Note that the cross product is zero indicates that the vectors a and x are parallel. 
Hence x = ra for a nonzero r E R. 


In summary, the eigenvalue of T is 0 and corresponding eigenvectors are ra for any nonzero real 
number r £ R. 


Comment. 


Let {ei, e 2 es} be the standard basis for R 3 and write a = 
Then we have 


a i 
a 2 

«3 



a i 


'i 


0 

TM = 

a 2 

X 

0 


as 


_ a 3_ 


.0. 


_—a 2 _ 


a\ 


"o' 


-as 

T(e 2 ) = 

a 2 

X 

1 

= 

0 


_«3_ 


.0. 


_ ai _ 


a\ 


'o' 


a 2 

r(e s ) = 

a 2 

X 

0 

= 

-a\ 


_«3_ 


_1_ 


0 


It follows that the matrix representation of the linear transformation T with respect to the standard 
basis is given by 


A = 


0 

as 

-a 2 


~a 3 

0 

a\ 


a 2 

-ai 

0 


Note that the matrix A is a real skew-symmetric matrix. 

We know that each eigenvalue of a skew-symmetric matrix is either 0 or purely imaginary. 
(See the post ““.) 

Also if the degree of the matrix is odd, it has 0 as an eigenvalue. 


Determine all linear transformations of the 2-dimensional x-y plane R 2 that take the line y — x to the line 

y = -ab¬ 


solution. 

2 2 

Let T : R —> R be a linear transformation that maps the line y — x to the line y = —x . 

Note that the linear transformation T is completely determined if the values of T on basis vectors of the 
2 

vector space R are known. 

Let 

























MGMl]} 


be a basis of R 2 . 


The reason of this choice is as follows. 

Since we know that T takes the line j; = x to the line y — —x , the vector 
point on the line y — —x . 


is mapped into some 


This is how we chose the vector 


, and we just chose the simple vector 


. The other basis vector could be any vector that is not a multiple of 

1' 


0 


Let 


for some a, b E R. 

Since we know that the vector T 


for some c E R. 


( 

' 1 ' 


a 


.0. 

)- 

b_ 


is on the line y — —x , let 


c 

—c 


We now find a formula for this linear transformation T. 


Let 


be an arbitrary vector in the plane R . 

We express this vector as a linear combination of basis vectors: 


LT 


= (x-y) 


+y 


Then we have 


x 

TJ 


= (*-y)T 
= (x-y) a 


1 
0 
1' 
. 0 . 

+y 


+y 


+yr( 


since T is a linear transformation 


c 

—c 


ax + (c — a)y 
bx — (c + b)y _ 
















































We conclude that any linear transformation T : R 2 R 2 that takes the line y — x to the line y — —x 
is of the form 


X 


ax + (c — a)y 

.y. 

)- 

_bx — (c + b)y _ 


for some a, Z?, c £ R. 

Remark. 

Remark that if c — 0, then all the points on the line y — x are mapped into the origin, which is on the line 
y = —x. 

If we want to avoid this degenerate case, we need to assume that c 7 ^ 0. 




Let F : R 2 —>> R 2 be the function that maps each vector in R 2 to its reflection with respect to X-axis. Determine 
the formula for the function F and prove that F is a linear transformation. 


Solution 1. 


2 

be an arbitrary vector in R . 

Its reflection with respect to X-axis is the vector 
Thus the formula for the function F is given by 





2 2 

Next, we prove that the function F is a linear transformation from R to R . 
We need to verify the following two properties: for any u, v £ 1R 2 and c £ R, 


we have 


O F{ u + v) = F( u) + F(v) 
Q F(cu) = cF(u). 


Let u = 


x 


andv = 


ly\ 

Then we have 



















F( u + v) = F 


X + X' 

.y + y 1 . 

X + x f 

~{y + y')_ 


X 

+ 

' x' ‘ 

.-y. 


v-y \ 


= F 


+ F 


X 

Lyj 

= F(u) + F(v). 


X 


l y 


by the formula for F 


by the formula for F 


This prove property (a). 


Next, we have 


F(cu) = F 


cx 


Icy] 


cx 

.-(cy). 


— C 


X 

-y J 

= cF{ u) 


by the formula for F 


by the formula for F. 


Hence property (b) is verified. 

Therefore, the function F is a linear transformation. 
Solution 2. 

We give another proof that F is a linear transformation. 
From the formula that we obtained in Solution 1, we see that 


X 


'i 

0 ■ 


X 

.y. 

)- 

.0 

-1. 


.y. 


It follows that the function F is the matrix multiplication by 


1 0 
0 -1 


As the matrix multiplication function is always a linear transformation, we conclude that F is a linear 
transformation. 


Observe that the matrix 
That is, 


1 

0 


0 

-1 


is the matrix representation of the linear transformation F. 






























1 0 
0 -1 


= [T( ei ),T(e 2 ], 


where ei = 


and e 2 


arc standard basis vectors of R 2 . 


o Let T : M 2 —>■ M 2 be a linear transformation of the 2-dimensional vector space M 2 (the x-y- plane) to itself 
which is the reflection across a line y — mx for some m E M. Then find the matrix representation of the linear 


transformation T with respect to the standard basis B = {ei , 02 } of M 2 , where ei = 

Solution. 

Let A be the matrix representation of T with respect to the standard basis B. 


5 e 2 


Observe that each vector on the line y — mx does not move under the linear transformation T. 

1 

is on the line y — mx, it follows that 


Since the vector 


m 


' 1' 


' 1 ' 

m_ 


m 


Note that if m 7 ^ 0 then the line y — — x is perpendicular to the line y = mx at the origin. 
If m — 0, then the line x — 0 is perpendicular to the line y — 0 at the origin. 

—m 


In either case the vector 


is on the perpendicular line. 


Thus, by the reflection across the line y — mx, this vector is mapped to 
That is, we have 


m 


—m 


m 

1 


.- 1 . 


(*) 




It follows from (*) and (**) that 



■ 1 

—m 



■ 1 ■ 


—m 



' 1 

m 

A 

m 

1 

— 

A 


A 

1 


— 

_m 

-1. 


The determinant of the matrix 


-m 


is 1 + m 2 7 ^ 0, hence it is invertible. 


m 1 

(Note that since column vectors are nonzero orthogonal vectors, we knew it is invertible.) 


The inverse matrix is 


' 1 

—m 

- X _ 1 

1 

m 

m 

1 

1 + m 2 

. ~ m 

1 _ 


Therefore, we have 











































A 


1 

m 



' 1 

—m 


m 

1 


1 m 

m —1 


_1 _ 

1 + m 2 


1 m 

—m 1 


_ 1 _ 

1 + m 2 


1 — m 2 
2m 


2m 

m 2 — 1 


In summary, the matrix representation A of the linear transformation T across the line y = mx 
with respect to the standard basis is 

1 — m 2 2m 
2m m 2 — 1 


A = 


1 + m 2 


Comment. 


The matrix representation A of a linear transformation T : R 2 —» M 2 with respect to the standard 
basis B = {ei,e 2 } ofM 2 is given by 


A = [T(e 1 ) T(e 2 )]. 

So we can technically find the matrix A by finding T(ei) and T(e 2 ), which can be found by 
elementary plane geometry. 


However, it is easier to determine the outputs of T for vectors on the line y — mx and vectors on the 
perpendicular line. 

That why we didn’t compute the vectors T(ei) and T(e 2 ) directly in the above solution. 


^ Let T : M 3 -> 

standard basis is given by the matrix A — 


be a linear transformation and suppose that its matrix representation with respect to the 

1 0 211 


0 3 0 
4 0 5. 

(a) Prove that the linear transformation T sends points on the x-z plane to points on the x-z plane. 

(b) Prove that the restriction of T on the x-z plane is a linear transformation. 

(c) Find the matrix representation of the linear transformation obtained in part (b) with respect to the standard basis 


> of the x-z plane. 


r 

"1' 


"0" 



0 

5 

0 

> 


1 

0 

_1 


.1. 

> 


I 


Proof. 


(a) Prove that the linear transformation T sends points on the x-z plane to points on the x-z 
plane. 


Each point on the x-z plane is of the form 



























for some x, z G R. 
We have 




X 

\ 

X 

T 


0 

= A 

0 



_ 

/ 

_ 


"1 0 2 " 


X 

0 3 0 


0 

4 0 5 


z 


x + 2 z 

0 

4x + 5z 


Since the y-coordinate of the last vector is 0, and thus the output vector lies in the x-z plane, 
(b) Prove that the restriction of T on the x-z plane is a linear transformation. 


o 

Let V be the x-z plane in R . 

Then V is a subspace of the vector space R 3 . 

In part (a), we showed that the restriction of T on V is given by the formula 


/ 

X 

\ 

x + 2 z 

T 

0 

= 

0 

V 

_ 

) 

_ 4 x + 5 z_ 


We abuse the notation and write this restriction as T : V —> V. 
(The precise notation is T \ v : V V.) 

Let 


Xl 


x 2 

0 

5 

0 

.Zl. 


_z 2 _ 


be arbitrary vectors in V and let r E R be an arbitrary real number. 
Then we have 

/ 

V 


Xl 



\ 


~X 1 

0 

+ 

0 






_^ 2 _ 

) 


.21 


(xi + X2 ) + 2 (zi + Z<l) 

0 

_ 4 (xi + x 2 ) + 5 (z! + z 2 ) 


xi + 2 z\ 


x 2 + 2 z 2 

0 

+ 

0 

4xx + 5 Zi_ 


_ 4^2 + 5^2 _ 


(*) 


T 







































and 


( 

Xi 

\ 


rx i 

\ 

r 

0 

= T 


0 


V 

_^1_ 

J 


_rz\ _ 

J 


{rx i) + 2{rz\) 

0 

_4(rxi) + 5(zi) 


= r 


x\ + 2zi 

0 

4^i + 5zi 


= rT 


xi 

0 

L^i J 


It follows that the restriction T : V V is a linear transformation. 


(c) Find the matrix representation of the linear transformation obtained in part (b) with respect 
to the standard basis 


Let B — {u, v} be the standard basis of the x- 

■z plane: 



'1 


'O' 

u = 

0 

,v = 

0 


.0. 


.1 

It follows from formula (*) that we have 




( 

"1" 

\ f 

1' 

T(n) = T 

0 

= 

0 

\ 

.0. 

) [ 

.4. 


= u + 4v 


and 


( 

'O' 

\ 

2" 

T(v) = T 

0 

= 

0 

\ 

_ 1 _ 

! 

.5. 


= 2u + 5v. 


Thus the coordinate vectors with respect to the basis B are 

1 


in u)] B = 


l4 j b 


, [r(v)]s = 


J B 


and the matrix representation of the linear transformation T : V V with respect to the standard 
basis B — {u, v} is 


[[T(u)] B ,[r(v)]B] = 


1 2 
4 5 































^ Let T: 


—y 


1 be the linear transformation defined by the formula 


( 


\ 

Xi + 3x 2 - 2x 3 

T 

X2 

= 

2xi + 3^2 

V 


/ 

x 2 +x 3 


Determine whether T is an isomorphism and if so find the formula for the inverse linear transformation T 1 


Solution. 

Let 5 = {ei,e 2 ,e 3 } be the standard basis of R 3 , where 


'1' 


"o' 


"o' 

0 

5^2 = 

1 

II 

CO 

0 

.0. 


.0. 


_1_ 


We determine the matrix representation [T]b of T with respect to the basis B . 
Since we have 



1 


3 


'-2' 

fiei) = 

1 

o to 

1_ 

>r(e 2 ) = 

3 

_1_ 

CO 

0 

. 1 . 


we have 


[r] s =[r( ei ) r(e 2 ) r(e 3 )] 


13-2 
2 3 0 

Oil 


This matrix is invertible and the inverse matrix is given by 


[T] 


-1 

B 


3 

-2 

-2 


-1 -6 

1 4 

1 3 


(See the post Find the Inverse Matrices if Matrices are Invertible by Elementary Row Operations for details 
of how to find the inverse matrix of this matrix.) 


This implies that the matrix T is an isomorphism. 


Observe that we have [T\ B l = [T 1 ]b • 
Thus, we obtain 























1 

"xi" 

\ 

"xi" 

T -i 

*2 

= [T~ 1 ]b 

*2 

\ 

_*3_ 

J 

_*3_ 


3 -1 -6' 


Xi 

-2 14 


X 2 

-2 1 3 . 


_*3_ 


3 xi — X 2 — 6x3 

— 2xi+X2+4x3 

— 2xi H - X 2 ~\~ 3 x 3 _ 


In summary, the formula for the inverse linear transformation T 1 is given by 


Xi 

\ 

1 

1 

X 

to 

1 

CO 

*2 

= 

—2xi + X 2 +4x3 

_*3_ 

/ 

_ —2xi + X 2 + 3X3 _ 


<D Let R n be an inner product space with inner product (x, y) = x T y for x,y G M n . A linear transformation 
T : M n —»> M n is called orthogonal transformation if for all x, y E M n , it satisfies 

(T(x),T( y)) = (x,y). 

Prove that if T : M n -A M n is an orthogonal transformation, then T is an isomorphism. 


Proof 1. 

As T is a linear transformation from M! 7 to itself, it suffices to show that T is an injective linear 
transformation. 

Suppose that T (x) = T (y) for xj GK” . 

We show that x = y . 

We have 

l|x-y|| 2 

= (x-y) T (x-y) 

= (x T -y T )(x-y) 

T T T T 

= X x-x y-y x + y y 

= (x,x) - (x, y) - (y,x) + (y,y) 

= (r(x),r(x)) - (T(x),T(y)) - (T(y),T(x)) + (T(y),T(y)) 
(since T is an orthogonal transformation) 

= (r(x),T(x)) - (T(x),T(x)) - (T(x),T(x)) + {T(x),T(x)) 
(since T(x) = T(y)) 

= 0 . 


It follows that ||x — y || = 0 and hence x = y . 

















This proves that T : W 1 —> WL n is injective. 


As T is an injective linear transformation from the n -dimensional vector space WL n to itself, it is also 
surjective, and thus T is an isomorphism. 

Proof 2. 

Recall that the linear transformation T is injective if and only if the null space J\[(T ) = {0} , that is, 
T (x) = 0 implies that x = 0 . 

We use this fact to show that T is injective. 

Suppose that T (x) = 0 . 

Then we have 

ll x ll 2 = ( x > x ) 

= (r(x), r(x)) as T is orthogonal 
= (0,0) = 0 as T (x) = 0. 

It follows that the length ||x|| = 0 , and hence x = 0 . 

This proves that the null space J\f [T ) = {0} and T is injective. 

As T is an injective linear transformation from M! 1 to itself, it is also surjective, and hence T is an 
isomorphism. 


Linear Transformation Between Vector Spaces 


Definition 


Let V and W be vector spaces over a scalar field K. 

A function T : V —> W is called a linear transformation if T satisfies the following two linearity 
conditions: For any x,y G F and c £ K , we have 

1 . T(x + y)=T(x) + T(y) 

2 . T(cx) = cT(x) 

The nullspace J\f (T) of a linear transformation T :V^W is 

Af(T) = {x £ F | T(x) = 0}. 

The nullity of T is the dimension of J\T(T ). 

Q The range 1Z(T) of a linear transformation T : Tf —> W is 

1Z(T) = {y £ W | y = T(x) for some x £ V}. 

The rank of T is the dimension of 1Z(T ). 


Summary 


Let T : V —> W be a linear transformation. 

T maps the zero vector Oy of V to the zero vector 0\y of W. That is, T(0y) = 0 w- 


Problems 


For an integer n > 0, let P n be the vector space of polynomials of degree at most n. The set 
B = {1, x, x 2 , • • •, x n } is a basis of P n , called the standard basis. Let T : P n —)■ P n +i be the map defined 
by, for f £ P n , T(f) (x) = xf(x). Prove that T is a linear transformation, and find its range and nullspace. 

Solution. 


T is a linear transformation 


We must show that for polynomials /, g £ P n and scalar c £ R, the function T satisfies 
T(/ + < 7 ) = T(/) + T(gf) and T(c/) = cT(/). The first is checked using associativity of 
multiplication: 


T(f + ^)(x) = x(f + ^)(x) = x/(x) + x^(x) = T(f)(x) + T(g){x). 


Similarly, 
















T(cf)(x) = x(cf)(x) = cxf(x) = cT(f)(x). 

The nullspace of T 

The nullspace of T is the set of polynomials f(x ) such that T(/) = 0. That is, xf(x) — 0. But 
this product can only be 0 if one of the terms being multiplied is 0. Because x 0, we must have 
that f(x) — 0. Thus the nullspace is the trivial vector subspace {0}. 

The range ofT 

The range of T is the set of polynomials of the form xf(x) for an arbitrary polynomial / £ P n . We 
can calculate this by calculating T for a basis of P n . Let B — {1, x, x 2 , • • • , x n } be a basis of P n . 

We see that T(x l ) — x l+l for 0 < i < n, and so the image of the basis is the set 

T(B ) = {x, x 2 , x 3 , • • •, x n+l }. 

Then the range of T must be the span of this set. Specifically, 

K(T) = Span(T(S)) = Span(x,a: 2 ,x 3 , • • • ,x n+1 ). 


Let P n be the vector space of polynomials of degree at most n. The set B — { 1 , x, x 2 , • • •, x n } is a basis of P n 
, called the standard basis. Let T : P4 —> P4 be the map defined by, for / G P4, T(/) (x) = f(x)— x— 1. 
Determine if T(x) is a linear transformation. If it is, find the matrix representation of T relative to the standard 
basis of P4. 

Solution 1. 


We claim that T is not a linear transformation. 
To see this, consider j{x) m \ + x. Then 


T(f)(x) = 1 + x-x— 1 = 0. 


On the other hand, 


T(l) + T(x) = (1-jt— 1) + (pc-x- 1) = -x-l. 

Because T( 1 + x) ^ T{\) + T(x), we see that T is not a linear transformation. 

Solution 2. 

Recall the fact that any linear transformation maps the zero vector to the zero vector. 
The zero vector in is the zero polynomial 6(x) = 0. 

Then we have 


T(6)(x) = 6{x) -x-l = - x - 1 7^ 0(x). 

Thus, the map T does not send the zero vector #(x) to the zero vector 6(x). 
Hence, T is not a linear transformation. 


Let P n be the vector space of polynomials of degree at most n. The set B — { 1 , x, x 2 , • • •, x n } is a basis of P n 
, called the standard basis. Let T : P3 —>• P5 be the map defined by, for / £ P3, T(f)(x) — ( x 2 - 2 )f(x). 
Determine if T{x) is a linear transformation. If it is, find the matrix representation of T relative to the standard 
basis of P3 and P5. 


Solution. 

We must check that T satisfies the two axioms for linear transformations. Suppose/,g G P 3 . Then 
T{f+g){x) = (x 2 - 2){f+g){x) = (x 2 -2)/(x) + (x 2 -2)g(x) = T(f){x) + T(g)(x). 
Second, if c £ R then 

T(cf)(x) = (x 2 -2)(c/)(x) = c(x 2 -2)/(x) = cr(/)(x). 

The two axioms are satisfied, and so T is a linear transformation. 


Now we find its matrix representation relative to the standard basis B — { l,x,x 2 ,x 3 } ofP 3 and the 
standard basis C — {l,x,x 2 ,x 3 ,x 4 ,x 5 } .To do this, for every f G B we calculate the coordinate 
vector of T (f) relative to the basis C . 

For example, we find that 7"(1) = x 2 — 2 . The coordinate vector for this element, relative to C, is 


[x 2 -2]c = 


-2 

0 

1 

0 

0 

0 


Similarly, we find that T (x) = x 3 — 2x . The coordinate vector for this element, relative to C , 


[x 3 -2x]c = 


0 

-2 

0 

1 

0 

0 

0 


Performing this process for the rest of the elements of B , we get 


[7V)] C = [x 4 -2x 2 ]c = 


— 1 

0 

_1 


1 

O 

0 


0 

-2 

0 

[r(x 3 )] c = [x 5 -2x 3 ]c = 

0 

-2 

1 


0 

1 - 

O 

1_ 


. 1 _ 


And now putting these four column vectors in order, we get the matrix representation for T : 











re = 


-2 0 

0 -2 

1 0 

0 1 

0 0 

0 0 


0 0 

0 0 

-2 0 

0 -2 

1 0 

0 1 


Let P 3 be the vector space of polynomials of degree 3 or less with real coefficients. 

(a) Prove that the differentiation is a linear transformation. That is, prove that the map T : P 3 —>• P 3 defined by 

nM )=£fw 

for any f(x) E P 3 is a linear transformation. 

(b) Let B = {1, x, x 2 , x 3 } be a basis of P 3 . With respect to the basis B, find the matrix representation of the 
linear transformation T in part (a). 


Proof. 

(a) Prove that the differentiation is a linear transformation. 

Let f{x ), g(x) G P 3 .By the basic properties of differentiations, we have 


T ifix) + s( x )) = C/W + sb)) 

= £c/W)+ £(*<*)) 

= r(/(x)) + r( g (x)). 


For/(x) E P 3 and r E R, we also have 


T (EE)) = ^r(EW) 
= rT (f ( x )) • 


Therefore, the map T is a linear transformation. 


(b) Find the matrix representation of the linear transformation T. 


We compute 


r(i) 

n* 2 ) 




r(x 3 ) = -y-(x 3 ) — 3 x 2 . 

ax 


It follows that the coordinate vectors are 





[T(1)]b = 


[T(x 2 )]b = 


0 

0 

0 

0 

0 

2 

0 

0 


[T(x)]b 


1 

0 

0 

0 


[T(x 3 )]b = 


0 

0 

3 

0 


Thus the matrix representation of the linear transformation T is given by 


[[r(i)] s ,[r(x)] s ,[r(x 2 )] s ,[r(x 3 )] s ] = 


0 10 0 
0 0 2 0 
0 0 0 3 
0 0 0 0 


Remark that we also write this as 

[7(1) T(x) T(x 2 ) 7(x 3 )] 

0 1 0 O' 

M 2 31 0 0 2 0 

= 1 X X 2 x d 

0 0 0 3 

^0 0 0 0 . 

Related Question. 

Letv4 be the matrix representation obtained in part (b). 

It is easy to see that A 4 is the zero matrix. 

We say that A is a nilpotent matrix. 

There is a theoretical explanation behind this. 

Check out the post “Differentiating Linear Transformation is Nilpotent” for an explanation. 


Let P 2 (M) be the vector space over R consisting of all polynomials with real coefficients of degree 2 or less. Let 
B — {l,£,£ 2 }bea basis of the vector space P 2 (R). For each linear transformation T : P 2 (R) —> P 2 (R) 
defined below, find the matrix representation of T with respect to the basis B. For f(x) £ P 2 (R), define T as 
follows. 

(a) 


n/w) 


(b) 


T (f( x )) - 


(t - x) 2 f(t) d t 


(c) 


T(f(x))=e x —(e~ x f(x)) 

ax 














Hint. 


To find the matrix, we just need to compute 'T(l), T{x), T{x 2 ) and read the coefficients of 1, x, x‘ 


Solution. 


(a)T(f(x)) = -£tf(x) - 3 j-f{x) 


dx 2J v J ~ dx J 
Since the derivative the constant function 1 is zero, we have 

r(i) = o. 

Since the second derivative of the function x is zero, we have 

T(x) = -3. 

Finally, we calculate 

T(x 2 ) = 2 - 6x. 


IS 


There with respect to the basis {l,x,x 2 } , the matrix for T 

0-3 2 

0 0-6 

_0 0 0 . 

(b) T{f{x)) = f\{t - x) 2 f(t) d t 
We first compute 


T(f{x)) — J {t 2 — 2tx — \-x 2 )f{t)dt 

= (jfM dt) - 2 tf{t) d t 1 x 


/> 


dt X 


Then we have 


Here we calculated the integrals 


r(i) = - + 2x 2 
4 

Tlx) = — —x 
x 

2^ 2 2 2 
TV) =- + 


J ldt=[t] 1 _ 1 =2, j t d* = ^t 2 j =0, 

/>= [TL=!■ Sf i,= [l* 4 ]!. =0 ' ff d,= [1 


-1 


Therefore, the matrix for the linear transformation with respect to the basis B — {1, x, x 2 } i 


is 


Cd | lO 










(c)T(f(x)) = e x ±{e X f(x)) 

Note that by the product rule for derivatives, 


r (/!-'-)) = e* f-e ‘fix) + e ' j 
= ~Ax) + £■/«. 


Thus we have 


r(i) = -l 
T (x) = — x + 1 
T{x 2 ) = — x 2 + 2x. 


Hence the matrix for the linear transformation T with respect to the basis B is 

'-1 1 0 ‘ 

0-12. 

.0 0 - 1 . 

Let U and V be vector spaces over a scalar field F. Define the map T : U — > V by T(u) = Oy for each vector 
u eU. 

(a) Prove that T : U — > V is a linear transformation. (Hence, T is called the zero transformation.) 

(b) Determine the null space Af(T) and the range 1Z(T) of T. 


Proof. 

(a) Prove that T : U V is a linear transformation. 

Let Ui, U 2 £ U and r be a scalar, that is, r £ F. 

It follows from the definition of T that 

r(ui) = Ok, T{y 2 ) = Ok, 
r(m + u 2 ) = Ok, %i) = Ok 


since Ui + U 2 ,nii £ U. 

Hence we have 

^(Ui +U 2 ) = 0 y = 0y + 0 V = T(ui) + J(u 2 ) 
T(rui) = 0y = rOy = rT(u\). 







Since these equalities holds for all Ui, 112 G U, and r G F, the map T : U V is a linear 
transformation. 

(b) Determine the null space J\f ( T ) and the range 1Z(T) of T. 

The null space J\f(T) of T is, by definition, 

Af(T) = {u G U | T(u) = 0 V }. 


Since T (u) — Oy for every uG U, we obtain 

N(T) = C/. 

The range 1Z(T) of T is, by definition, 

7£(r) = {v G F | there exists uGf such that 7(u) = v}. 

Since every vector of U is mapped into 0 y , we have 

n(T) = {o F }. 


^ Let Q denote the set of rational numbers (i.e., fractions of integers). Let V denote the set of the form x + y\/ 2 
where x,y £ Q. You may take for granted that the set If is a vector space over the field Q. 

(a) Show that B — {1, y/2] is a basis for the vector space V over Q. 

(b) Let a = a + b\J 2 G V , and let T a : V — > V be the map defined by 


T a (x + yy/2) := (ax + 2by) + (ay + bx)^/2 £ V 
for any x + y^/2 £ V. Show that T a is a linear transformation. 


(c) Let 


ly] B 


— x + Find the matrix Tg such that 


+ 2/\/2) — ( Tb 


X 

ly}/B 


and compute det Tb- 
(The Ohio State University) 


Hint. 

For (a), to show that B is linearly independent, consider a linear combination 

ci • 1 + 6*2 \/2 — 0 , 

where Ci , C2 G Q . Use the fact that \f2 is a irrational number to conclude Ci = C2 = 0. 


^ You may directly check the linearity, or note that 

T a (x + yV 2) = a(x +jy / 2). 







Proof. 

(a) Show that B = {1, y/2 } is a basis for the vector space V over Q. 

Since V — {x • 1 + yV 2 | x,y G Q} = Spanjx, y} , we see that B — {1, \/2} spans the vector 
space V. 

To show that B is a linearly independent set, consider a linear combination 

C\ • 1 + C'2 \/2 — 0, 


where c \, C 2 G Q. 


We want to show that c\ — C 2 — 0. Assume that this is not the case. Then both c\ and C 2 must be 
nonzero. 

Then we have 




ci_ 
C2 ’ 


(Note that c 2 7 ^ 0.) 

Since C\ , C2 G Q, the fraction is also a rational number. However y/2 is irrational number. 

Hence this is a contradiction and we must have Ci — C 2 — 0. Thus B is a linearly independent set, and B 
is a basis. 

(b) Show that T a is a linear transformaiton. 

First note that we can simplify the given formula as 

Ta( x +yV 2 ) = (a + by/2)(x + yy/2) = a(x + Ta/ 2 )- 

Namely, T a is a multiplication by a. 

Letxi +y\y/2,X2 + T 2 a /2 G V, where Xi,ji,x 2 ,T 2 E Q. 

Then we have 

T a ((*1 +Tia/2) + ( x 2 +T2a/ 2)) — «((^i +Tia/2) + ( x 2 +T2a/2)) 

= a(x 1 + «(^2 +T 2 a/ 2 ) 

= +Tia/2) + 'Ta( x 2 +T2A/2)- 

For x + Ta/ 2 G F and c G Q, we also have 

T a (c(x + yy/2)) = a (c(x + yy/2)) 

= ca(x +yy / 2) = cT a (x+yy / 2). 


Therefore T a is a linear transformation. 

(c) Find the matrix Tb and compute det Tb . 



To find the matrix representation for the linear transformation T a with respect to the basis B , we compute 
T a (1) and T a (\/2)- 

We have 


T a ( 1) = a • 1 = a + by /2 = 


b 


T a (y/2) — a • y/2 = 2b + ay/2 = 


26 

a 


J B 


Thus the matrix is 


The determinant of this matrix is 


T b 


a 2b 
b a 


det Tb — a 2 — 2b 2 . 

Comment. 

In a typical first year linear algebra course, we use real numbers R or complex numbers C as a base field. 
Or just state theorems using a field K , for a general field K but most of the examples use R or C. 


In this sense this problem is unorthodox because the field is rational numbers Q. 
But the problem is quite typical in Field theory or in Number theory. 


Let V be the vector space of all 2 X 2 real matrices and let P 3 be the vector space of all polynomials of degree 3 
or less with real coefficients. Let T : P 3 —>• V be the linear transformation defined by 


T(a 0 + a\X + a 2 x 2 + a 3 x 3 ) = 


a 0 a 2 ~ a 0 a 3 

_a\ — a 2 —cl\ — CI3 _ 


for any polynomial «o + Cb\X + a 2 x 2 + CI 3 E P 3 . Find a basis for the range of T, 7 Z(T), and determine the 
rank of T, rank(T) , and the nullity of T, null(T) . Also, prove that T is not injective. 

Proof. 

Let {1, x, x 2 , x 3 } be a basis of the vector space P 3 . 

Then the range of T is spanned by the images of these basis vectors. That is, 

71(7) = Span ! 7(l), T(x), T(x 2 ), 7(x 3 )}. 

Thus, the range 1Z(T) is spanned by the following matrices 


m = 


1 -1 1 
0 0 


T(x) = 


ro 0 

1 -1 


T(x 2 ) = 


1 0 
-1 0 


T(x 3 ) = 


0 1 
0 -1 


We find a basis among these matrices. 


















Let B = {E n , E 12 , E 2l , E 22 } be the standard basis of V, where the (z,y)-entry of E -is 1 and the other 
entries are 0. 

With respect to this basis B , the coordinate vectors of the above matrices are 



■ 1 ■ 


■ o ■ 


-1 


0 

[TWls = 

0 

, [T(x)] b = 

1 


0 

■ ■ 


-1 

■ ■ 


[T(x 2 )] b 


■ 1 1 


1 0 1 

0 

, [T(x 3 )] b = 

1 

-1 

0 

0 

1 1 


-1 

1 1 


Let A be the matrix whose columns are these vectors: 

1 1 0 1 0 1 

-10 0 1 

A= 0 1-10 

o-io-i 

I ■ 

Applying the elementary row operations, we obtain 



■ 1 


0 

1 

0 1 

r 2 + r x 

1 1 

0 

1 

0 


-1 


0 

0 

1 

r a + r 3 

0 

0 

1 

1 

— 

0 


1 

-1 

0 


0 

1 

-1 

0 


0 


-1 

0 

-1 


0 

0 

-1 

-1 


■ 




1 


■ 






*1 

0 

1 

0 1 

R1-R3 
R 2 + R 2 

■1 

0 

0 

-1 1 


•*3 

0 

1 

-1 

0 

r 4 + r 3 

0 

1 

0 

1 


0 

0 

1 

1 


0 

0 

1 

1 



0 

0 

-1 

-1 


0 

0 

0 

0 


The last matrix is in reduced row echelon form and the first three columns contains the leading 1 ’s. 
Thus, the first three matrices T( 1), T(x), T(pc 2 ) form a basis for the range 1Z(T). 

(This is the “leading 1 method”.) 

(Note also that the last column gives the coefficients of the linear combination 
T(x 3 ) = - HI) + T(x) + T(x 2 ).) 


It follows that the rank, which is the dimension of the range, is 3. 
By the rank-nullity theorem we have 




















rank(T) + null(T) = dim( J P 3 ) = 4. 


We deduce that the nullity is 1. 

Finally, since the nullity of Tis 1, the linear transformation T is not injective. 
(Recall that T is injective if and only if the nullity is 0.) 

Another way to see that T is not injective is as follows. 

We saw that F(x 3 ) ff - T(l) + T(x) + T(x 2 ). This implies by linearity that 

T(x 3 ) = T(- 1 + x + x 2 ). 

Thus the values of T at two distinct polynomials x 3 and -1 + x + x 2 are the same. 
Hence T is not injective. 


In conclusion, 


Wl), T(x), T(x 2 )} 


"1 "I 
0 0 


■0 0 
1 -1 


■ 1 

-1 


0 ■ 

0 ’ 


is a basis for the range of T. We have 

rank(7) = 3 and null(7) = 1. 


^ Let V be a real vector space of all real sequences 

( a 4“l = (°1> a 2, • • •)• 


Let U be the subspace of V consisting of all real sequences that satisfy the linear recurrence relation 
a k+2 ~ 5a& + i + 3dk = 0 for k = 1,2,.... 

(a) Let 


Ul = (1,0,-3,-15,-66,...) 
u 2 = (0,1,5,22,95,...) 


be vectors in U . Prove that {ui, U 2 } is a basis of U and conclude that the dimension of U is 2. 

(b) Let T be a map from U to U defined by 

^((oi.oa, •••)) = (02,03,.. •)• 

Verify that the map T actually sends a vector E V to a vector in U, and show that T is a 

linear transformation from U to U . 

(c) With respect to the basis {ui, U 2 } obtained in (a), find the matrix representation A of the linear 
transformation T : U —>• U from (b). 


Proof. 

See the post Sequences satisfying linear recurrence relation form a subspace for a proof that U is a 
subspace of V . 

(a) Prove that { 111 , 112 } is a basis of U and the dimension of U is 2. 










Let 


(a«)“ 1 = (ai,a 2 ,as ,...) 

be an arbitrary vector in U. 

If we know the first two terms a \, a 2 , the remaining terms are determined by the linear recurrence relation 
dk -\-2 — 5a^ + i + 3ak = 0. Thus if the first two terms of two sequences of U are equal, then the 
sequences are equal. 

From this observation, we have 


= 01 U 1 + a 2 n 2 . 

Hence {lli, u 2 } is a spanning set of U. 


It remains to show that {lli, u 2 } is a linearly independent set. 
Suppose that 


ci ui + c 2 vl 2 = 0, 


where 0 is the zero sequence. 

Then since we have 

( 0 , 0 , 0 ...) 

= C 1 U 1 + c 2 u 2 

= Cl (l,0,-3,...)+c 2 (0,l,5,...) 
(ci 5 c 2 , 3ci + 5 c 2 ,...), 


we must have Ci — c 2 — 0. Hence {ui, u 2 } is a linearly independent set, and it is a basis. Therefore the 
dimension of U is 2. 

(b) T is a linear transformation from U to U. 

Let us write T ((a z - )^ 1 ) = (6/ )?? 1 . Thus = a^i by definition of T . Then we have 

bk+ 2 — 5bk +1 + 3 bk — dk+z — 5dk+ 2 + 3ak+i — 0 


for k — 1,2,... since (a z - )^ 1 G U, hence satisfies the linear recurrence relation. 

Thus the sequence ( 6 / )^ 1 also satisfies the recurrence relation, and it is in U . Thus T sends a vector in U 
to a vector in U. 


Let us prove that T is a linear transformation from U to U. 

Let {cii , (bi )^ 1 be vectors in U and let r G M be a scalar. 
Then we have 


T({ ai )? =1 +{b i )? =1 ) 

= T(( ai +b i )™ 1 ) 

— (a 2 + b 2 ,a 3 + b 3 ,...) 

= (a 2 ,a 3 ,...) + {b2,b 3 ,...) 

= T((°i)£i) + T ((*)&) 

Also we have 

= T ({cai)™ i) 

= (ca 2 ,ca 3 ,...) 

= c(a 2 ,a 3 ,...) 

= cT (Mm) 

Therefore T is a linear transformation. 

(c) The matrix representation^ of the linear transformation T 

We compute the values of the linear transformation of T on basis vectors Ui, 112 and express them as a 
linear combination of lli, 112 . 

We have 

r( Ul ) = r((i,o,-3,-i5,-66,...)) 

— (0, —3, —15, —66,...) 

= -3(0,1,5,22,...) 

= Oui — 3u 2 . 


and also we have 


r(u 2 ) = r((o, 1,5,22,95,...)) 

= (1,5,22,95,...) 

= (1,0,-3,-15,...)+ (0,5,25,110,...) 

= Ui + 5ll2« 

Therefore the matrix representation of T with respect to the basis B — {m, 112 } is given by 


A = [[r(ui)] s ,[r(u 2 )] s ] = 


Here[r(ui)]g = 

[r(u2)]i]= \] 



is the coordinate vector of 7"(ui) with respect to the basis B . Similarly for 


Remark. Using the matrix A for the linear transformation T , we can write formally 

[r(ui) r(u 2 )] = [ui u 2 ] 0 1 

—3 5 


0 Let V be the vector space of all n X n real matrices. Let us fix a matrix A £ V. Define a map T : V —> V by 
T(X) = AX- XA for each iGf. 

(a) Prove that T : V —> V is a linear transformation. 

(b) Let B be a basis of V. Let P be the matrix representation of T with respect to B. Find the determinant of P. 










Proof. 

(a) Prove that T : V —> V is a linear transformation. 

To prove T is a linear transformation, we need to show the following properties. 
$) For anyX, Y E V , we have T (X + Y) = T(X) + T{Y) . 


o For any X E V , r E R , we have T (rX) — rT (X) . 


To check condition 1, let X,Y E V . Then we have 

T(X+ Y) =A(X+ Y) - (X + Y)A 
— AX + AY -XA-YA 
= AX - XA + AY - YA 
= T(X) + T{Y) 


by definition of T 


by definition of T. 


Hence condition 1 is met. 

To verify condition 2, let X E V, r E R . 

Then we have 

by definition of T 
r is a scalar 

by definition of T. 


T(rX) - A(rX) - (rX)A 
= rAX - rXA 
= r(AX - XA) 

= rT{X) 


So condition 2 is also met, hence T is a linear transformation. 

(b) Find the determinant of the matrix representation of T. 

Let B be a basis of the vector space V and let P be the matrix of the linear transformation T with respect 
to B . We prove that the determinant of P is zero. 

Let / be the n X n identity matrix. Then we have 

T(I) = AI-IA=A-A = 0 , 


where O is the n X n zero matrix. 

Since T(O) — O , this implies that the linear transformation T is not injective, hence P is a singular 
matrix. 




Let us explain the details. 

2 

Let v = [I\g G R” be the coordinate vector of / with respect to the basis B . 

Then since Ip O , the vector v is not zero. 

Then T{I) — O implies that 

p v = oeR n \ 

As the nonzero vector v is a solution of the matrix equation Px — 0 , the matrix P is singular. 
Since P is singular, the determinant of P is zero. 


The space C°° (R) is the vector space of real functions which are infinitely differentiable. Let 
T : C°° (R) —>• P3 be the map which takes / E C°° (R) to its third order Taylor polynomial, specifically 
defined by 

T(f)(x) = m + f'(0)x + Pp-x 2 + Pp-x 3 . 

Here, f , f" and f" denote the first, second, and third derivatives of /, respectively. Prove that T is a linear 
transformation. 


Proof. 


To prove that T is a linear transformation, we must check that it satisfies the two axioms for linear 
transformations. First, suppose/ g E C^R). Then basic properties of the derivative tell us that 

(f+g)\x) =f(x) + g'(x) 

(f+g)"(x) =f(x) + g"(x) 
if + g)"(x) =/"(x) + g"'(x). 

And so we can see 


T(f+ g)(x) 

= (f+g)(0) + (f+g)'(0)x + 


<f+g){0 ) , (f+g) (0) 


-x 2 + 


2 6 
/(0) , g"(0) , /'(0) 


= m +g(0) +/ (0)x+g ( o) X +—x z + 2 


x 2 + x 3 + 


,A°) ,. /"( 0 ) 
2 

= T(1)(x) + T(g)(x). 


g (0) 
6 6 

g '(0) , g"'(0) 


= ./(0) +/ (0)x + — x z + —X* + g(0) + g (0)x + ^-X z + 


Now for a scalar c € R, the equality ( cf) (x) = cf (x) is a basic property of the derivative. Then, 


T(cf)(x) = cm + (cf) (0)x + 


(cf) (0) (cf) (0) 

- X Z H- } 

2 6 

/( 0 ) , /'( 0 ) , 


= c/(0) + cf (0)x + c . x~ + c 


/ , /( 0 ) , /"( 0 ) ^ 

= c|./(0) +/ (0)x + + “g - • 

= cT(f)(x) 



















Thus, T is a linear transformation. 


<D Let P n be the vector space of all polynomials with real coefficients of degree n or less. Consider the 
differentiation linear transformation T : P n — >• P n defined by 

(a) Consider the case n — 2 . Let B — {1, cc, a? 2 } be a basis of P 2 - Find the matrix representation A of the linear 
transformation T with respect to the basis B. 

(b) Compute A 3 , where A is the matrix obtained in part (a). 

(c) If you computed A 3 in part (b) directly, then is there any theoretical explanation of your result? 

(d) Now we consider the general case. Let B be any basis of the vector space of P n and let A be the matrix 
representation of the linear transformation T with respect to the basis B. Prove that without any calculation that 
the matrix A is nilpotent. 


Background. 

Differentiating is a linear transformation. 


Recall that differentiating a polynomial is a linear transformation. 

See the post “Differentiation is a linear transformation” for a proof. 

The post also contains a similar problem for n — 3 . 

Definition (Nilpotent matrix). 

A square matrix A is called nilpotent if there exists a positive integer k such that A k — O, the zero 
matrix. 


Solution. 

(a) Find the matrix representation^ of the linear transformation T with respect to the basis B . 

To find the matrix representation^ of T , we first find the coordinate vectors as follows. 

Since we have 


T( 1) = 0 = 0-l + 0-x + 0-x 2 
T(x) = l = l-l + 0-x + 0-x 2 
T (x ) — 2 x = 0* 1 H - 2 • x H - 0 • x 

we have the coordinate vectors of these with respect to the basis B is 


[T(1)]b = 

'o o 

, TMls = 

'i 

0 

to 

II 

'O' 

2 


0 


0 


0 









Hence the matrix representation A of T is 

A = [ [T( 1)] B [T(x)] b [T(x 2 )] b 


0 1 0 
0 0 2 
0 0 0 


(b) Compute A 3 , where A is the matrix obtained in part (a). 
We calculate 



'0 

1 

o' 


'0 

1 


O' 


'0 

0 

2' 



to 

0 

0 

2 


0 

0 

2 


0 

0 

0 

and , 


.0 

0 

0 . 


.0 

0 

0 . 


.0 

0 

0 . 






'0 

0 

2' 


'0 

1 

o' 


'0 

0 

o' 

N 

CO 

to 

= 

0 

0 

0 


0 

0 

2 

= 

0 

0 

0 




0 

0 

0 


0 

0 

0 


0 

0 

0 


Thus, A 3 is the zero matrix. 


(c) Is there any theoretical explanation of your result? 

Recall from calculus that differentiating a polynomial once lowers the degree by one. Thus if we 
differentiate a degree 2 polynomial once, then you get a degree 1 polynomial. If we apply the 
differentiation again, we obtain a degree 0 polynomial, which is a constant. If we differentiate once more, 
it becomes the zero polynomial. 

Hence differentiating a polynomial of degree 2 or less three times, the polynomial becomes the zero 
polynomial. 

Differentiating once is the linear transformation T , hence multiplying by A corresponds to differentiating 
once. 

o 

Therefore, A corresponds to differentiating three times. 

This is why A 3 is the zero matirx. 


(d) Prove that without any calculation that the matrix A is nilpotent. 

We generalize the argument in part (c) to any positive integer n . Recall again differentiating a polynomial 
once lowers the degree by one. 

Thus, if we differentiate a degree n polynomial n + 1 times, then the polynomial becomes the zero 
polynomial. 

In other words, we have 
















T n+1 (f{x)) = 0 


for any/(x) G P n • 

Thus we have T n+l is the zero linear transformation. 

In terms of the matrix representation, this means that we have A n+l = O, the zero matrix. 


G Let W = C°° (M) be the vector space of all C°° real-valued functions (smooth function, differentiable for all 
degrees of differentiation). Let V be the vector space of all linear transformations from W to W. The addition and 
the scalar multiplication of V are given by those of linear transformations. Let T\ , , T 3 be the elements in V 

defined by 


T i(f(x)) = ±f(x) 

T 2( f ( x )) = ^f( x ) 

T 3 ( f(x)) = [ f(t) d t. 
Jo 


Then determine whether the set {T\ , T 2 , T 3 } are linearly independent or linearly dependent. 


Proof. 

We prove that the set {T\ , T2 , T3} are linearly independent. 

Suppose that we have a linear combination 

C1T1 +c 2 T 2 +c 3 r 3 = 0 , (*) 

for some real numbers c \, C 2 , C 3 . 

Note that 0 is the zero linear transformation 0(/(x)) = 0 for all x, and it is the zero vector in the vector 
space V. 

The equality (*) is an equality as linear transformations. 

We need to show that Ci = C 2 = C 3 = 0. 


Let us consider the function f(x) = 1 in C 00 (R). 
Then we have 


Ti (1) = 0 
T 2 ( 1 ) = 0 


T 3 



x. 


These yield that 


0 — ci7i(l) + c 2 T 2 (1) + c 3 T%{1) — C 3 X, 


and hence C 3 = 0. 



Next, consider the function f{x) — x . 
We have 


T\ (jc) = 1 
T 2 (x) = 0 


and 


0 = ciTiix) + c 2 T 2 {x ) = ci. 


Thus Ci = 0. 


Finally, consider the function f(x) — x 2 . 
We compute 


r 2 (x 2 ) = 2, 


and the equality (*) together with Ci = C 3 = 0 gives 

0 = C 2 T 2 (x 2 ) = 2 c 2 . 


So C 2 =0, and we have proved that c\ — c 2 — C 3 = 0. 

Thus we conclude that the set {7i, T 2 , is linearly independent. 


0 Let V denote the vector space of all real 2 
given as below. 


X 2 matrices. Suppose that the linear transformation from V to V is 


T{A) 


2 

5 


A-A 


3 

7 


Prove or disprove that the linear transformation T : V —> V is an isomorphism. 


Hint. 

A linear transformation T is an isomorphism if it is both surjective (onto) and injective (one to one). 
Recall that a linear transformation T is injective if and only if the kernel is trivial, that is, ker(r) = {0} . 

Proof. 

We claim that T is not an isomorphism. 

Recall that an isomorphism means that T is surjective (onto) and injective (one to one). 

Thus, to disprove that T is a linear transformation, it suffices to prove that T is not surjective or T is not 
injective. 







Let us prove that T is not injective. 

Equivalently, we show that the kernel ker(T) is nontrivial, that is, ker(T) 7 ^ {0} , where O is the 
2x2 zero matrix. 


If A is in ker (T) , then we have O = T(A) and it follows that 


2 

5 




3 

7 


In other words, a matrix is in the kernel if it commutes with the matrix 


2 

5 


3 

7 


So out goal is to find nonzero matrices that commute with 


2 

5 


3 

7 


For example, the 2 X 2 identity matrix / will do. 

Hence the kernel of T contains a nonzero matrix, hence T is not injective. 
Thus, T is not isomorphism. 


Comments 


Note that we did not have to determine the kernel. 
What we needed is to show that ker(T) 7 ^ {0}. 
We proved this by showing that / £ ker(T). 


Another matrix in the kernel is 


2 

5 


3 

7 


itself. 


Also, note that we proved that T is not an isomorphism because T is not injective. 
We could have tried to prove that T is not surjective but this is harder. 




Let U and V be vector spaces over a scalar field F. Let T : U —>• V be a linear transformation. Prove that T is 
injective (one-to-one) if and only if the nullity of T is zero. 


Definition (Injective, One-to-One Linear Transformation). 

A linear transformation is said to be injective or one-to-one if provided that for all Ui and Ui in U, 
whenever T (u 1 ) = T (U 2 ), then we have u 1 = U 2 . 

Proof. 

): IfT is injective, then the nullity is zero. 

Suppose that T is injective. 

Our objective is to show that the null space J\f(T ) = {0 jj} • 

Since T is a linear transformation, it sends the zero vector 0 jj of U to the zero vector 0 y of V . 

In fact, we have 















T(Ou) = T{Ou- Ou) 

= T(O u + (-l)O u ) 

= T(Ou) + (—l)r(Oc/) by linearity of T 
= T(0u)-T( 0u) = 0 v . 


Hence Of/ E Af(T). 

On the other hand, if u E AT {T ), then we have 

T (u) = 0 v = T(0u). 

Since T is injective, it yields that u = 0 jj . 

Therefore we obtain Af{T ) = {Of/}, and the nullity of T is zero. 

(Recall that the nullity of T is the dimension of Af{T ) .) 

( *<= ): If the nullity is zero, then T is injective. 

Next, suppose that the nullity of T is zero. 

This is equivalent to the condition Af(T) — {Of/}. 

Our goal is to show that T : U V is injective. 

Suppose that r(ui) = ^( u 2 ) for some u 1,112 E U. 

Then we have 

o v = r(m) - r(u 2 ) 

= r( Ul ) + (-i)r(u 2 ) 

= 7"(ui + ( — 1 ) 112 ) by linearity of T 

= ^(ui - u 2 ) 


It follows that the vector m — u 2 is in the null space Af(T ) = {Of/}. 
Thus, we have Ui — u 2 = Of/, or m = u 2 . 

So the linear transformation T is injective. 

Related Question. 

Problem. 

Let U and V be finite dimensional vector spaces over a scalar field F. 
Consider a linear transformation T : U V . 

Prove that if dim (U) > dim(R), then T cannot be injective. 


The proof of this problem is given in the post 1 

A Linear Transformation T : U -A V cannot be Injective if dim (U) > dim (V) 


^ Let U and V be finite dimensional vector spaces over a scalar field F. Consider a linear transformation 
T : U —> V. Prove that if dim (JJ) > dim(R), then T cannot be injective (one-to-one). 


Hints. 

You may use the folowing facts. 

A linear transformation T : U —> V is injective if and only if the nullity of T is zero. 

For the proof of this fact, see the post i 

A Linear Transformation is Injective (One-To-One) if and only if the Nullity is Zero 

Rank-Nullity Theorem 

For a linear transformation T : U -A V , we have 

null(r) + rank(r) = dim((7). 


We give two proofs. The first one uses the rank-nullity theorem. 

The second one avoids using the theorem. 

Proof 1. 

By the rank-nullity theorem, we have 

null(r) + rank(r) = dim((7). 

Note that the rank of T is the dimension of the range TZ(T) , which is a subspace of the vector space of V. 
Hence it yields that 

rank(r) = dim(TZ(T)) < dim(F). 


It follows that 


null(r) = dim((7) — rank(7) > dim((7) — dim(F) > 0, 


where the last inequality follows from the assumption. 

This implies that the nullity is nonzero, hence T is not injective. 

(See the hints above.) 

Proof 2. 

In the second proof, we prove T is injective without using the rank-nullity theorem. 

Seeking a contradiction, assume that T : U -A V is injective. 

Let {ui 5 be a basis of the vector space U , where n — dim((7). 

Put v i = T (u!) = r(u„). 



We claim that the vectors v i,. . ., v n are linearly independent in V . 
To see this, consider the linear combination 

C 1 V 1 H-1- c„v„ = Of, 


where c \,..., c n are scalars in F. 

Then we have 

Of = civi H-1- c„v„ 

= ciT(ui) H- V c n T (u„) 

= r(ciui H-b c n u„). 


Since T is assumed to be injective, we must have 

ClUl H-h c n u n = Of/. 

Because Ui,..., u w are basis vectors, they are linearly independent. 
It follows that 

Ci — • • • — c n — 0, 

and hence the vectors Vi,. . ., v n are linearly independent. 


As the vector space V contains n linearly independent vectors, we see that 

dim(F) > n — dim(t/), 

which contradicts the assumption that dim([/) > dim(F). 

Therefore, T cannot be injective. 


o Let V be a vector space over the field of real numbers M. Prove that if the dimension of V is n, then V is 
isomorphic to M n . 

Proof. 

Since V is an ^-dimensional vector space, it has a basis 

{Vp ...,v„}, 


where each is a vector in V. 


Define a map T\ V —> R n by sending each vector v e fto its coordinate vector [v] 5 with respect to 
the basis B. 

More explicitly, if 

v = CjVj + - + C n v n with C X ,.... c„ e R, 
then the coordinate vector with respect to B is 


Mg = 


€ R”. 


Then the map T: V —► R n is defined by 


m 


It follows from the properties of the coordinate vectors that the map T is a linear transformation. 

We show that T is bijective, hence an isomorphism. 

T is injective. 

To show that T is injective, it suffices to show that the null space of T is trivial: Af{T) = { 0 }. 

(See the post “A Linear Transformation is Injective (One-To-One) if and only if the Nullity is Zero” 
for a proof of this fact.) 

If v 6 Af(T), then we have 

o - m = [v] 5 . 

So the coordinate vector of v is zero, hence we have 

v = Ovj + —t- 0v^ = 0. 

Thus, Af(T) = { 0 }, and T is injective. 

T is suijective. 

To show that T is surjective, let 


a = 


be an arbitrary vector in R n . 
Then consider the vector 


v:=fl 1 v 1 + 


+ a n y n 








in V. 

Then it follows from the definition of the linear transformation T that 


m m Mb = 


= a. 


Therefore T is surjective. 


In summary, T: V —► R^ is a bijective linear transformation, and hence T is an isomorphism. 
Thus, we conclude that V and are isomorphic. 

Generalization 

We may use more general field F instead of the field of real numbers R. 

Then the general statement is as follows. 


If V is an ^-dimensional vector space over a field F, then V is isomorphic to F w . 


The proof is identical as above except that we replace R by F everywhere. 




Orthogonal Bases 


Definition 


Let V be a subspace in M n . 

If a basis B for V is an orthogonal set, then B is called an orthogonal basis. 

If a basis B for V is an orthonormal set, then B is called an orthonormal basis. 


Summary 


Let V be a subspace in M n . 

Q From any basis B of V , the Gram-Schumidt orthogonalization produces an orthogonal basis B f for 

V. 


Problems 

Let S — { Vi, V2 } be the set of the following vectors in R 4 . 


"1" 


"0" 

0 

and v 2 — 

1 

1 

1 

_ 0 _ 


_ 0 _ 


Find an orthogonal basis of the subspace Span(S') of R 4 . 


(a) Solution 1. (Using the Gram-Schmidt process) 

It is straightforward to check that the vectors v 1, v 2 are linearly independent, and hence the set S is a 
basis of Span( 5 ) . 

Since the dot (inner) product of Vi and V2 is 

Vi • v 2 = 1 ^ 0, 

S is not an orthogonal basis. We apply the Gram-Schmidt process to generate an orthogonal basis from the 
basis S . 

The Gram-Schmidt process for two vectors is as follows. We define vectors Ui, U2 by the following 
formula. Then B — {ui, U2 } is an orthogonal basis of Span (S) . 






















ui := V! 

ui * v 2 

112 := V 2 -ui. 

ui • Ui 


Since we have 


Ui • v 2 = Vi • v 2 = 1 and ui • ui = vi • vi = 2, 


we compute 


1 

u 2 = v 2 - -ui 


o' 


"1" 

1 

1 

0 

1 

2 

1 

0 


0 


-i/: 

l 

1/2 

0 



Therefore the set 



is an orthogonal basis of Span (S) . 

Note that scaling by a nonzero scalar does not change the orthogonality, the set 



is also an orthogonal basis of Span (S) , just in case you prefer not to have a fraction. 

(a) Solution 2. (Using a pattern of the Gram-Schmidt process) 

Here is another solution using a partial information of the Gram-Schmidt process. 

As in Solution 1 , the set S is a (non-orthogonal) basis of Span( 5 *) . 

We want to apply the Gram-Schmidt process but suppose we only remember the pattern of the Gram- 
Schmidt process. Namely, we want to define orthogonal vectors u, u 2 by 


ui := vi 

u 2 := v 2 + au\. 
























Here a is some number, which is given in the Gram-Schmidt process (*) but we don’t remember. 
We can still determine the number a as follows. Since Ui and 112 will be orthogonal, we have 

0 = Ui • U2 = Ui • (v2 + <ZUi) 

= Ui • v 2 + ax 11 • ui 

— 1 H - 2 d. 


Hence, we obtain a — — 1 / 2 . 
Then we determine 


1 

U 2 = V 2 - -Ui 


o' 


'1" 

1 

1 

0 

1 

~~ 2 

1 

0. 


.0. 


'- 1/2 

1 

1/2 

0 



(So we could complete the Gram-Schmidt process even though we didn’t remember the details.) 
Hence the set 


/ 

- r 


1/2" 

\ 

/ 

0 


1 



1 

? 

1/2 

1 

< 

.0. 


. 0 . 

j 


is an orthogonal basis of Span (S) 
(b) Solution. 


We first express the vector 


as the linear combination 


3 

-2 


= 3 


= 3 e 1 — 2 e 2 . 


Then we compute 

























Therefore we have found 


r(3 ei - 2e 2 ) 

3T(ei) — 2r(e 2 ) by linearity of T 



'5' 


"8" 

3 

1 

- 2 

2 


1 

to 

I_ 


6 


'15' 


'16' 


'-1" 

3 

— 

4 

= 

-1 

. 6 . 


.12. 


.-6. 



Let W be a subspace of M 4 with a basis 



"l" 


o 


< 

0 


1 

> 


1 


1 



_ 1 _ 


_ 1 _ 

> 


Find an orthonormal basis of W. 
(The Ohio State University) 

Solution. 

Let 



"1 1 


‘O’ 


0 


1 

v i = 

1 

,V 2 = 

1 


1 

1 1 


1 

1 1 


Note that they are not orthogonal as the dot product is 


v 


l 


■1 1 


■O' 

0 


1 

1 


1 

1 

1 1 


1 

1 1 


= 1 • 0 + 0 • 


1 + 1 - 1 + 1 - 1 = 2 ^ 0 . 


Let us first find an orthogonal basis for W by the Gram-Schmidt orthogonalization process. 
Let Wj :=Vj. 

Next, let w 2 := v 2 + aw ^ where a is a scalar to be determined so that • w 2 = 0. 

(You may also use the formula of the Gram-Schmidt orthogonalization.) 

As and w 2 is orthogonal, we have 
































0 = Wj • W 2 = Vj • (v 2 + flVj) 
= V 1 •v 2 + av 1 • Vj 
= 2 + 3a. 


It follows that a = - 2/3 and 



■ 0 1 


■1 1 

2 

1 

2 

0 

3 V l = 

1 

~ 3 

1 


1 

■ ■ 


1 

■ ■ 


Now, to avoid fractions in our computation, let us consider 3w 2 , instead of w 2 . Note that the scaling 
does not change the orthogonality. 

We have 



■ 0 1 


-1 - 


'-2 ■ 


1 


0 


3 

3w 2 = 3 

1 

-2 

1 

= 

1 


1 

■ ■ 


1 

■ i 


1 

■ i 


Thus the set {w 1? 3w 2 } is an orthogonal basis for W. 

However, the length of these vectors are not 1 as we see 

Hwi 1 = yi 2 + 0 2 + 1 2 + l 2 = a/3 

I3w 2 II = 2) 2 + 3 2 + l 2 + l 2 = ^15. 


Now it suffices to normalize the vectors w 1? 3w 2 to obtain an orthonormal basis. 
Therefore, the set 


f 

■1 ‘ 


■-2 1 

\ 

1 1 

0 

1 

3 



1 

’ y /15 

1 



1 

1 1 


1 

■ i 

/ 


is an orthonormal basis for W. 

Comment. 

This is one of the midterm 2 exam problems for Linear Algebra (Math 2568) in Autumn 2017. 

One common mistake is just to normalize the vectors by dividing them by their length ^3. 

The resulting vectors have length 1, but they are not orthogonal. 

Another mistake is that you just changed the numbers in the vectors so that they are orthogonal. 
The issue here is that if you change the numbers randomly, then the new vectors might no longer 




















belong to the sub space W. 

The point of the Gram-Schmidt orthogonalization is that the process converts any basis for W to an 
orthogonal basis for W. 

The above solution didn’t use the full formula of the Gram-Schmidt orthogonalization. Of course, you 
may use the formula in the exam but you must remember it correctly. 


O 


Let A — 


1 

0 


0 1 
1 0 


(a) Find an orthonormal basis of the null space of A. 

(b) Find the rank of A. 

(c) Find an orthonormal basis of the row space of A. 
(The Ohio State University) 


Solution. 


First of all, note that A is already in reduced row echelon form. 


(a) Find an orthonormal basis of the null space of A. 


Let us find a basis of null space of A. 

The null space consists of the solutions of^4x = 0. 


Since A is in reduced row echelon form, the solutions x = 


x 2 

* 3 


satisfy 


hence the general solution is 


Xj = — and x 2 = 0, 



■-V 


'-I ' 

X = 

0 

= x 3 

0 


x 3 


1 


Therefore, the set 



is a basis of the null space of A. 

Since the length of the basis vector is^(-l) 2 + 0 2 +l 2 = ^2, it is not orthonormal basis. 
Thus, we divide the vector by its length and obtain an orthonormal basis 
















(b) Find the rank of A. 

From part (a), we see that the nullity of A is 1. The rank-nullity theorem says that 

rank of ^4 + nullity of A = 3. 

Hence the rank of A is 2. 

The second way to find the rank of A is to use the definition of the rank: The rank of a matrix B is the 
number of nonzero rows in a reduced row echelon matrix that is row equivalent to B. 

Since A is in echelon form and it has two nonzero rows, the rank is 2. 

The third way to find the rank is to use the leading 1 method. By the leading 1 method, we see that the 
first two columns form a basis of the range, hence the rank of A is 2. 


(c) Find an orthonormal basis of the row space of A. 

By the row space method, the nonzero rows in reduced row echelon form a basis of the row space of 
A. Thus 



is a basis of the row space of A. 

Since the dot (inner) product of these two vectors is 0, they are orthogonal. 
The length of the vectors is ^2 and 1, respectively. 

Hence an orthonormal basis of the row space of A is 


1 

72 


n roi 


Letvi — 


2/3 

2/3 

1/3 


be a vector in IRr. Find an orthonormal basis for WL 6 containing the vector Vi. 


The first solution uses the Gram-Schumidt orthogonalization process. 
On the other hand, the second solution uses the cross product. 

Solution 1 (The Gram-Schumidt Orthogonalization) 

First of all, note that the length of the vector Vj is 1 as 













We want to find two vectors v 2 , v 3 such that {v 1? v 2 , v 3 } is an orthonormal basis for R 3 . 
The vectors v 2 , v 3 must lie on the plane that is perpendicular to the vector Vj. 

So consider the subspace 



'x" 



'x " 


-2/3 1 


w = | 

y 

6 R 3 


y 


2/3 

= 0 

( 

z 



z 


1/3 



Note that W consists of all vectors that are perpendicular to v 1? hence W is a plane that is 
perpendicular to 


The relation 


can be written as 


or equivalently 


'x ' 


■2/3 ' 

y 


2/3 

z 


1/3 

2 

2 

1 


= 0 


~x + Tf + = 0, 

3 3 3 


z = ~ 2x ~ 2y. 


Hence the vectors in W can be written as 


'x " 


■ X 


■ 1 ■ 


■ 0 ■ 

n 'x; 

■ _ 

= 

y 

-2x - 2 y 

= X 

_ ■ 

+y 

1 

-2 


It follows that 



is a basis for the subspace W. Let us call these vectors u 1? u 2 , respectively. 


We apply the Gram-Schmidt orthogonalization to this basis {u 1? u 2 } and obtain an orthogonal basis as 
follows. 

We do not change the first vector: let w 1 = u } . 


























Next, we set 


w 2 = u 2 + aiii 


for some scalar a. 

To determine a , we compute 

0 = w j • w 2 = u j ■ (u 2 + au j) 

= u 1 • u 2 + au l • 

p= (1 • 0 + 0 • 1 + ( - 2) • ( - 2)) + a{\ • 1 + 0 • 0 + ( - 2) • ( - 2) 
= 4 + 5a. 


Hence, a = - 4/5 and we obtain 


Wn 


1 


0 

1 

-2 



(Note that you may use the Gram-Schumidt orthogonalization formula, instead of the above method.) 


As scaling does not change the orthogonality, consider 5w 2 , instead of w 2 (to avoids fractions). 
We have 



■ 0 ■ 


' 1 1 


'-4' 

5w 2 = 

5 

-10 

-4 

■ _ 

= 

5 

-2 


Therefore, {w 1? 5w 2 } is an orthogonal basis of W. 


We obtain an orthonormal basis of W by normalizing the length of these basis vectors. 
As 


llw! II = yi 2 + 0 2 + (-2) 2 = s\5 


and 


II5w 2 II = yj( - 4) 2 + 5 2 + (- 2) 2 * ^45 = 3^/5, 

the vectors 


W 1 

V2 ’ llw j II 


1 

T 5 

















and 


V3 ' I5w 0 ll 


5w 2 i 
2 " 3^/5 


-4 

5 

-2 


form an orthonormal basis of W. 

Note that as the vectors v 2 , v 3 lie in W , they are still perpendicular to the vector \ v 

It follows that {v 1? v 2 , v 3 } is an orthonomal set in R 3 , thus it is an orthonormal basis for R 3 . 

Solution 2 (Cross Product) 

Next, we solve the problem using the cross product. 


Letv 1 = -u 1? where = 


Our first goal is to find the vectors u 2 and u 3 such that {u 1? u 2 , u 3 } is an orthogonal basis for R 3 


Let x = 


be a vector that is perpendicular to u 1 . 


Then we have x • u l = 0, and hence we have the relation 

2x + 2y + z = 0. 


For example, the vector u 2 := 


1 

0 

-2 


satisfies the relation, and hence u 2 • Uj = 0. 


(So far, it is not so different from Solution 1.) 


Now, let us define the third vector u 3 to be the cross product of u 1 and u 2 


u 3 :=u l Xu 2 = 


By the property of the cross product, the vector u 3 is perpendicular to both u 1? u 2 . 
Therefore, the set 


■2- 


' 1 - 


■-4' 

2 

X 

0 

= 

5 

1 


-2 


-2 


{“l, U 2 , u 3> = 


-2 ' 


' 1 1 


'-4 " 

2 

? 

0 

? 

5 

1 


-2 


-2 























is an orthogonal basis for R 3 as it consists of three nonzero orthogonal vectors. 


Finally, to obtain an orthonormal basis for R 2 , we just need to normalize the lengths of these vectors. 
The lengths are 

llu, II = ^2 2 + 2 2 + l 2 = a/9 = 3 
llu 2 ll = Vl 2 + 0 2 + (-2) 2 = ^5 
lu 3 ll = V( - 4 ) 2 + 5 2 + (- 2 ) 2 = ^45 = 3^5. 


Then we have v, = -—- 
1 II U I II 

u 2 u 3 

Let v 9 := -—r and := -—-. 

Z II U 2 II 3 II U 3 II 


It follows that the set 


{ViV 2 V 3 } = 


j 1 

■ 

(N (N 

■_ 

1 

' 1 1 

0 

1 

_■ 


1 ^ 

1 

’ V5 

-2 

’ 3v/5 

-2 



is an orthonormal basis for R . 


Let S = {vi, V 2 ,..., V/,} be a set of nonzero vectors in M n . Suppose that S is an orthogonal set. 

(a) Show that S is linearly independent. 

(b) If k = n, then prove that S' is a basis for M n . 


Proof. 

(a) Show that S is linearly independent. 

Consider the linear combination 

ClVi + C2V2 + • • • + CkVk — 0. 
Our goal is to show that C\ — c<i — • • • — Ck — 0. 


We compute the dot product of v, and the above linear combination for each / = 1,2,... ,k : 

0 = v, • 0 

= V/ • (civi + c 2 v 2 H-h c k Vk) 

= ClVi ■ Vi + c 2 V; • v 2 H-h c k Vi ■ v k - 

As S is an orthogonal set, we have v, • v, — 0 if i 7 ^ /. 

Hence all terms but the z-th one are zero, and thus we have 












Since V; is a nonzero vector, its length || v z -1| is nonzero. 

It follows that Ci — 0 . 

As this computation holds for every i — 1, 2,..., k , we conclude that c\ — — • • • = Ck — 0. 

Hence the set S is linearly independent. 

(b) If k = n , then prove that S is a basis for W 7 . 

Suppose that k — n . Then by part (a), the set 5 consists of n linearly independent vectors in the 
dimension n vector space WL n . 

Thus, S is also a spanning set of R” , and hence S is a basis for J$L n . 


Determinants of Matrices 


Summary 


Let A, B be n X n matrices. 

A is nonsingular if and only if det(j 4 ) 7^ 0 . 

© det(AB) = det(;4)det(£). 

If A is invertible, then det(.A _1 ) = det(j4) _1 . 


Problems 

Let A and S ben x n matrices, where n is an integer greater than 1. Is it true that 

det(A + B) — det(vl) + det(B)? 

If so, then give a proof. If not, then give a counterexample. 

Solution. 


We claim that the statement is false. 

As a counterexample, consider the matrices 


A = 


1 0 
0 0 


and B = 


0 0 
0 1 


Then 


A + B = 


and we have 


det( J 4 “b B ) — 

On the other hand, the determinants of A and B are 


1 0 
0 1 


1 0 
0 1 


= 1 . 


det(A) = 0 and det(S) = 0 , 


and hence 


det(A) + det(B) = 0^1 = det(.4 + B). 

Therefore, the statement is false and in general we have 


det(j4 + B) 7 ^ det(A) + det(S). 


















Remark. 

When we computed the 2 X 2 matrices, we used the formula 


a b 
c d 


ad — be. 


This problem showed that the determinant does not preserve the addition. 
However, the determinant is multiplicative. 

In general, the following is true: 


det(AB) =det(i4)det(S). 


Let 



0 10 

T -jr x —3 

4 x 


Find all values of X such that A is invertible. 
(Stanford University) 



Calculate the determinant of the matrix A. 


Solution. 

A matrix is invertible if and only if its determinant is non-zero. 
So we first calculate the determinant of the matrix A. 


By the first column cofactor expansion, we have 

= 2 7 + 1 " 3 
4 x 

= 2 ((7 + x)x - (-3)4) = 2(x 2 + 7x + 12) 
= 2(x + Z)(x + 4). 

Thus the determinant of A is zero if and only if x — —3 or x — —4. 

Therefore the matrix A is invertible for all x except x — — 3 and x — — 4. 



Let 

'1 -x 0 0 

^ 0 1 —x 0 

0 0 1 -x 

_0 1 0-1 

be a 4 X 4 matrix. Find all values of x so that the matrix A is singular. 

Hint. 


Use the fact that a matrix is singular if and only if the determinant of the matrix is zero. 












To compute the determinant, use a cofactor expansion. 

Solution. 

We use the fact that a matrix is singular if and only if the determinant of the matrix is zero. 


We compute the determinant of A as follows. 


det(A) = 


1 -x 

0 1 
0 0 
0 1 


0 0 

—x 0 
1 —x 

0 -1 


1 

—x 

0 


—x 

0 

0 


—x 

0 

0 


—x 

0 

0 

0 

1 

—x 

- 0 

0 

1 

—x 

+ 0 

1 

—x 

0 

+ 0 

1 

—x 

0 

1 

0 

-1 


1 

0 

-1 


1 

0 

-1 


0 

1 

—x 


by the first column cofactor expansion 

1 —x 0 
= 0 1 —x 

1 0 -1 


1 

—x 

-0 

—x 0 

+ 

—x 0 

0 

-1 


0 -1 


1 —x 


by the first column cofactor expansion 

= —I + x 2 . 

Therefore we have det(^4) = x 2 — 1. Thus det(A) = 0 if and only if x — =bl. 
We conclude that the matrix A is singular if and only if x — db 1. 


Comment. 

You may use the rule of Sarrus to compute the 3 X 3 determinant instead of the cofactor expansion if 
you like so. 


Find all the values of X so that the following matrix A is a singular matrix. 


x x 2 1 



Use the fact that a matrix is singular if and only if its determinant is zero. 

Solution. 

Note that a matrix is singular if and only if its determinant is zero. 

So we compute the determinant of the matrix A as follows. 






















det(+) = 


= (- 1) 3+1 • 0 • 


XXL 

2 3 1 

0-11 

x 2 1 

3 1 


+ (- 1) 3+2 • (- 1 ) 


by the third row cofactor expansion 
= 0 + (x — 2) + (3x — 2x 2 ) 

= — 2a: 2 + 4x — 2. 

Thus the determinant of A is zero if 


+ (-l) 


3+3 


• 1 • 


det(^4) = — 2x 2 + \x — 2 = 0, 


x 2 

3 


equivalently, 

x 2 — 2x + 1 = (x — l) 2 = 0. 

Thus, the determinant of the matrix A is zero if and only if x — 1. 
Hence the matrix A is singular if and only if x — 1. 


Find the value(s) of h for which the following set of vectors 


/ 

"1" 


h 


1 

\ 

< Vi = 

< 

0 

. 0 . 

,V 2 = 

1 

_-h_ 

,v 3 = 

2 h 

_3/i + l. 

> 

> 


is linearly independent. 

(.Boston College) 

Solution 1. 

Let us consider the linear combination 

®lVi + x 2 v 2 + x 3 v 3 = 0 . (*) 

If this homogeneous system has only zero solution X\ = x 2 = £3 =0, then the vectors Vi , v 2 , V3 
are linearly independent. 

We reduce the augmented matrix for the system as follows. 


" 1 

h 

1 

0 " 

-R3 +/ll?2 
-» 

" 1 

h 

1 

0 " 

0 

1 

2h 

0 

0 

1 

2 h 

0 

0 

-/i 

3h + 1 

0 


_ 0 

0 

2h 2 + 3h + l 

0 _ 


From this, we see that the homogeneous system (*) has only the zero solution if and only if 

2h 2 + 3h + 1 ^ 0. 

Since we have 

2h 2 + 3h + 1 = (2 h + 1 )(h + 1), 
if h 7 ^ — ^, —1, then 2 h 2 + 3h + 1 7 ^ 0. 

In summary, the vectors Vi, v 2 , V3 are linearly independent for any h except — ^ — 1. 

























Solution 2. 

Note that the vectors Vi, V 2 , V 3 are linearly independent if and only the matrix 


A := [viv 2 v 3 ] 


1 h 1 
0 1 2 h 

0 -h 3h + l 


is nonsingular. 


Also, the matrix A is nonsingular if and only if the determinant det(j4) is nonzero. 

So we compute the determinant of the matrix A by the first column cofactor expansion and obtain 


det(^4) 


1 h 1 
0 1 2 h 

0 -h 3/t + l 

1 2 h 

—h 3/i + l 


= 2h 2 + 3h + 1 
= (2 h + 1 )(/i + 1). 

Hence det(A) ^ 0 if and only if h ^ , — 1. 

Therefore, the vectors Vi, V2, V3 are linearly independent for any values of h except j , — 1. 


Let 


"1 

0 

3 ' 


"2 

0 

o' 

4 

5 

6 

and B — 

0 

3 

0 

1 

0 

9. 


.0 

0 

4_ 


Then find the value of 


det (A 2 B~ 1 A~ 2 B 2 ). 

(Without a proof, you may assume that A and B are invertible matrices.) 

(a) Solution. 

To determine eigenvalues of A, we compute the determinant of A — XI. 
We have 


det (A - XI) 


3 - A -2 
6 -4- A 

(3 - A)(—4 - A) + 12 
A 2 + A = A(A + 1). 


The eigenvalues are solutions of det (A — XI) = 0, hence eigenvalues of A are 0, — 1. 


Next, we find the eigenvector corresponding to the eigenvalue A = 0. 
Eigenvectors x are nonzero solutions of (^4 — 0/)x = 0. 














Thus, we solve Ax = 0. The augmented matrix of the system is 


3 

-2 

0 

to 

i 

to 

3 

-2 

1 - 

o 

1 - 

-4 

l 

O 


0 

0 

- 1 

o 


Thus, if x 


xi 

X2 


is a solution, then X\ 


o 

j a? 2 , hence 


X = £ 2 


2/3 

1 


1 

0 



0 

0 


is an eigenvector corresponding to A = 0 for any nonzero scalar £2 • 


Finally, we find the eigenvectors corresponding to the eigenvalue A = — 1. 

In this case we need to solve (A — ( — l)/)x = 0. 

The augmented matrix is 

1 - 1/2 0 

0 0 0 


4 

-2 

0 


2 

-1 

0 

i?2 —R\ 

2 

-1 

l 

o 

I* 

6 

-3 

0 

- * 

\R2 

2 

-1 

0 

-> 

0 

0 

0 

-> 


It follows that the eigenvectors associated to A = — 1 are 


x = X 2 


' 1 / 2 ' 

1 


for any nonzero scalar X 2 . 

(b) Solution. 

We use the following two properties of determinants. 
Let C and Dben X n matrices. Then we have 


det(C'D) = det(C) det(Z>) 


and if C is invertible, then 


det(C' _1 ) = det((7) -1 


1 

det(C) ‘ 


Using the properties of determinants, we compute 

det (A 2 B~ 1 A~ 2 B 2 ) 

— det(j4) 2 det(S) -1 det( J 4) -2 det(l?) 2 

= det(j4) 2 det (A) 2 det(S) 1 det(l?) 2 (determinants are just numbers) 
= det(B). 


Hence it suffices to find the determinant of the matrix B. 

Since the matrix B is an upper triangular matrix, its determinant is the product of diagonal entries, 
thus 



































det (B) = 2 ■ 3 ■ 4 = 24. 


As a result, we obtain 

det (A 2 B- 1 A- 2 B 2 ) = 24. 


Let A be an n x n nilpotent matrix, that is, A m = O for some positive integer m, where O is the n x n zero 
matrix. 

Prove that A is a singular matrix and also prove that / — A, I + A are both nonsingular matrices, where I is the 
n X n identity matrix. 

Hint. 

We give two proofs. The first one uses only some properties of the determinant of a matrix. 

The second one uses the theory of eigenvalues. 

We summarize several facts we will use below. 


In proof 1. 

( ' A matrix is nonsingular if and only if it is invertible. 

Q A matrix is nonsingular if and only if its determinant is nonzero. 


For two square matrices A, B, we have 

det (AB) = det (A) det (B). 


In proof 2. 


0 


Eigenvalues of A are all scalars X such that the matrix A - XI is nonsingular. 


Proof 1. 

We use the fact that a matrix is nonsingular if and only if its determinant is nonzero. 
Since the determinant is multiplicative, we have 

0 = det ( O) — det ( A m ) = det ( A) m . 

This implies that det (A) = 0, and hence the matrix A is singular. 



Next, we show that the matrix I — A is nonsingular. Note that we have 

I = I- A m = (I-A)(I + A+A 2 + -+A m ~ X ). 

This implies that the matrix 1 +A + A 2 -t -f- A m 1 is the inverse matrix oil- A. 

Thus the matrix I — A is invertible, and hence it is nonsingular. 


Next, we show that the matrix / + A is nonsingular. 

Note that we have 

I-(-A) m = (/+ A)(/-A+A 2 -••• + (- l) m ~ 1 A m ~ 1 ). 

Since ( - A) m = ( l) m ^4 m = ( l) m O = O, we obtain 

/=(/ + ^)(/-^+^ 2 -- + (- 1 

and this implies that 1 + A is invertible matrix with the inverse matrix 

{I + A)~ X =I-A+A 2 -- + (- l) m ~ l A m ~ l . 

Therefore, the matrix / + A is nonsingular. 

Proof 2. 

In the second proof, we will use an eigenvalue technique. 

Let X be an eigenvalue of the matrix A and let x be the corresponding eigenvector. Namely, we have 

Ax = Xx. 


Then we have 


0 = Ox = A m x 

= A m ~ x Ax = = - = l m \. 


It follows that X = 0. Thus the only eigenvalue of A is zero. 

Recall that eigenvalues X are all scalars such that A— 2/ is a singular matrix. 

Since X = 0 is an eigenvalue of A, the matrix A = A - 0/ is singular. 

Since 1 is not an eigenvalue, we see that A— I is nonsingular, and hence I- A = - (A -1) is also 
nonsingular. 

Similarly, since -1 is not an eigenvalue, we observe that 1 + A = A+1 is nonsingular. 

Related Question. 

A natural source of nilpotent matrices is the differentiation linear transformation of vector space of 
polynomials. 

Check out the post “Differentiating Linear Transformation is Nilpotent” for problems and solutions 
about the nilpotency of the differentiation linear transformation. 


Related Question. 

Here is another problem about a nilpotent matrix. 


Problem. 

Suppose that A is an n x n nilpotent matrix and B is an n x n invertible matrix. 

Is the matrix B - A invertible? If so, give a proof. Otherwise, give a counterexample. 


Let A be a 3 X 3 matrix. Let x, y, z are linearly independent 3-dimensional vectors. Suppose that we have 


Ax — 

"l" 

0 

,Ay = 

"O' 

1 

, Az — 

"l" 

1 


1 


0 


1 


Then find the value of the determinant of the matrix A. 


Solution 1. 


Let B be the 3 x 3 matrix whose columns are the vectors x, y, z, that is, 

B = [xyz]. 


Then we have 


Then we have 


AB 


1 0 1 
0 1 1 
1 0 1 


det (A) det (B) = det (AB) 


1 0 1 
0 1 1 
1 0 1 


= 0. 


(If two rows are equal, then the determinant is zero. Or you may compute the determinant by the 
second column cofactor expansion.) 

Note that the column vectors of B are linearly independent, and hence B is nonsingular matrix. Thus 
the det (B) ^ 0. 

Therefore the determinant of A must be zero. 

Solution 2. 

Since 


"1 1 


'O’ 


■ 1 - 

0 

+ 

1 

= 

1 

1 


0 


1 



















we have 


It follows that we have 


Ax+Ay = Az. 


A(x + y - z) = 0. 

Since the vectors x, y, z are linearly independent, the linear combination x + y-z^0. 

Hence the matrix A is singular, and the determinant of A is zero. 

(Recall that a matrix A is singular if and only if there exist nonzero vector v such that Au = 0.) 


0 Determine the values of x so that the 


matrix 


A = 


1 1 x 

1 X X 
XXX 


is invertible. For those values of x , find the inverse matrix A 1 . 

Solution. 


We use the fact that a matrix is invertible if and only if its determinant is nonzero. 
So we compute the determinant of the matrix A. 


We have 


det (A) 


1 1 x 

1 x x 
xxx 


= 0 ) 


X X 
X X 




= (x 2 - x 2 ) - (x - x 2 ) + x(x - x 2 ) 
= (x - l)(x - x 2 ) 


= x(x - l) 2 . 


X 

X 


by the first row cofactor expansion. 


Thus, the determinant det (A) is zero if and only if x = 0, 1. 
Hence the matrix A is invertible if and only if x ^ 0, 1. 


Next, we suppose thatx ^ 0, 1 and find the inverse matrix of A 
We reduce the augmented matrix [A | 7] as follows. 

We have 












[A\I\ = 


r i 1 jc 1 o 

1 x x 0 1 

x x x 0 0 


O' 

0 

1 









"l 

1 

X 

1 

0 

0 

-1 

1 

X 

1 

0 

0" 

1 




-l 

1 


0 

x-l 

0 

-1 

1 

0 

x~ 1 A 

0 

1 

0 

X- 1 

X- 1 

0 

0 

0 

X - x 2 

-x 

0 

1 

1 

2 R 3 




-1 


1 







x-x J 

0 

0 

1 

1 -X 

0 

X-X 2 


1 0 x 


0 1 0 


0 0 1 


X 

-1 

X- 1 

X- 1 

-1 

1 

X- 1 

X- 1 

-1 

0 

1 -X 


10 0 0 


- 1 -X 


-1 1 

0 10 — — 0 

X-l X-l 


-1 1 

0 0 1 “— 0 - 7 

1-X r-r 2 


Now that we reduced the left 3x3 matrix into the identity matrix, the right 3x3 matrix is the inverse 
matrix of A. 

(Note that when we applied elementary row operations, we divided by x - 1 and x - x 2 , and this is 
where we needed to assume x ^ 0, 1.) 

We have 


0 


- 1 -X 

X-X 2 


A~' = 


-l l 

X-l X-l 


0 


-1 

1 -X 


1 

x(l -x) 


0 

X 

-x 


X 

-x 

0 


-x " 


0 

1 


<D Prove that if n 


X n matrices A and B are nonsingular, then the product AB is also a nonsingular matrix. 


(The Ohio State University) 


Definition (Nonsingular Matrix) 


Ann X n matrix is called nonsingular if the only solution x G R” of the equation Xx = x is 

x = 0 


Proof. 



















We give two proofs. The first one uses a property of the determinants of matrices, and the second one 
uses the definition of nonsingular matrices. 

Proof 1. (Using Determinant) 

Recall that a matrix is nonsingular if and only if its determinant is not zero. 

Since A and B are nonsingular, we know that 

det(A) 7 ^ 0 and det(B) ^ 0. 

Then we have using the multiplicative property of the determinant 

det(AB) = det(i4)det(B) ^ 0. 

Since the determinant of the product AB is not zero, we conclude that AB is a nonsingular matrix. 

Proof 2. (Using Definition of Nonsingular Matrices) 

Suppose that A, B are nonsingular matrices. 

This means that if Ax. — 0 for some the vector x £ R n , then we must have x = 0. 

Same for B. 

Suppose that we have (. AB)x — 0 for some vector x £ R n . 

Let v = Bx £ R n . Then we have 


Av = ABx = 0 . 


So since A is a nonsingular matrix, we have v = 0, namely, Bx — 0. 

Since B is nonsingular, this further implies that x = 0. 

In summary, whenever ( AB)x — 0, we have x = 0. 

Therefore, the matrix AB is nonsingular. 


<D Let n be an odd positive integer. Determine whether there exists an n X n real matrix A such that A 2 + / = O, 
where / is the n X n identity matrix and O is the n X n zero matrix. If such a matrix A exists, find an example. 
If not, prove that there is no such A. How about when n is an even positive number? 

Hint. 

The key technique to solve this problem very easily is determinant. 

Recall the following properties of the determinant. 

Let A, B be n X n matrices and c be a scalar Then we have 

$> det(AB) = det(A)det(S) 


G det(cA) = c”det(A). 



Solution. 

When n is odd. 

When n is odd, we prove that there is no n X n real matrix A such that A 2 + I — O. 
Seeking a contradiction, assume that we have A such that A 2 + I = O. 

Since we have A 2 = —I, we have 

det(A 2 ) = det(—/). 

Using the properties of determinant, we obtain 

det (A) 2 = (—l) n det (I) = -1 

because n is odd and det (/) = 1. 

Since A is a real matrix, the determinant of A is also real. 

Thus, det(^4 ) 2 = — 1 is impossible. Hence there is no such A. 


When n is even. 


On the other hand, if n is even there is A such that A 2 +1=0. 
For example, consider 


A = 


0 

1 



Then a direct computation shows that A 2 = — 7, hence A 2 + I = O. 


Comment. 


Recall that the imaginary number i is the number whose square is — 1. 

„ r° - ] 

Similarly, we found above that the square of the matrix A = 


is —I. 


But when n is odd, there is no such matrix A as we showed. 


0 Prove that the determinant of an n X n skew-symmetric matrix is zero if n is odd. 

Definition (Skew-Symmetric) 

A matrix A is called skew-symmetric if A T = — A. 

Here A T is the transpose of A. 

Proof. 

Properties of Determinants 

We will use the following two properties of determinants of matrices. 

For any n X n matrix A and a scalar c, we have 







^)det(^4) = det(A T ), 


^)det(cA) = c”det(A). 


Main Part of the Proof 

Suppose that n is an odd integer and let A be an n X n skew-symmetric matrix. 
Thus, we have 

A t = -A 


by definition of skew-symmetric. 
Then we have 


det(^4) = det(yl T ) 

= det(— A) 

= (-l) n det (A) 
— — det(-A) 


by property 1 

since A is skew-symmetric 

by property 2 

since n is odd. 


Therefore, it yields that 2 det(A) = 0, and hence det(A) = 0. 


Comment. 


The result implies that every odd degree skew-symmetric matrix is not invertible, or equivalently 
singular. 


Also, this means that each odd degree skew-symmetric matrix has the eigenvalue 0. 

Related Question. 

The eigenvalues of a real skew-symmetric matrices are of the special form as in the next problem. 


Problem, (a) Prove that each eigenvalue of the real skew-symmetric matrix A is either 0 or a 
purely imaginary number. 

(b) Prove that the rank of A is even. 




Computations of Determinants 


Problems 

Find the determinant of the matix 


100 

101 

102 

101 

102 

103 

102 

103 

104 


Solution. 

Note that the determinant does not change if the 2-th row is added by a scalar multiple of the j -th row 
if i / j. 

We use this fact about the determinant and compute det(^4) as follows. 


det(^4) 


100 

101 

102 

101 

102 

103 

102 

103 

104 

100 

101 

102 

101 

102 

103 

1 

1 

1 

100 

101 

102 

1 

1 

1 

1 

1 

1 

100 

101 

102 

1 

1 

1 

0 

0 

0 


(by Rz- R2) 


(by R2- R\) 


(by Rz- Ri) 


— 0 (by the third row cofactor expansion.) 


Therefore the determinant det (/l) is zero. 















Introduction to Eigenvalues and Eigenvectors 


Definition 


Let A be an n X n matrix. 

A scalar A is called an eigenvalue of A if the equation Ax = Ax has a nonzero solution x. 

Such a nonzero solution x is called an eigenvector corresponding to the eigenvalue A. 

Q The characteristic polynomial of A is the degree n polynomial p{t) = det(A — tl). 

Q If p(t) — (t — Ai) ni • • • {t — \k) nk is a factorization of the characteristic polynomial of A , where 
Ai,. . ., Ajfe are distinct eigenvalues of A, then the algebraic multiplicity of the eigenvalue A* is n*. 


Summary 


Let A be an n X n matrix. Letp(t) be the characteristic polynomial of A. 

Q The degree of p(t) is n. 

Q A is an eigenvalue of A if and only ifp(A) = det(^4 — XI) — 0. 

A has at least one eigenvalue and has at most n distinct eigenvalues. 

A has at most n distinct eigenvalues. 

The eigenvalues of a matrix A are roots of the characteristic polynomial of A. 
The eigenvalues of a triangular matrix are diagonal entries. 


Problems 


(a) True or False. If each entry of an n X n matrix A is a real number, then the eigenvalues of A are all real 
numbers. 

(b) Find the eigenvalues of the matrix 


B = 


-2 

5 



(The Ohio State University) 

Hint. 

Consider a 2 X 2 matrix. 

Then the eigenvalues are solutions of a quadratic polynomial. 


Does a quadratic polynomial always have real solutions? 

Solution. 


















(a) True or False. If each entry of an n X n matrix A is a real number, then the eigenvalues of 
A are all real numbers. 

False. In general, a real matrix can have a complex number eigenvalue. In fact, the part (b) gives an 
example of such a matrix. 

(b) Find the eigenvalues of the matrix 
The characteristic polynomial for B is 

det(i? — tl ) = 

The eigenvalues are the solutions of the characteristic polynomial. Thus solving t 2 + 1 = 0, we 
obtain eigenvalues =bz, where i — y/—l- 

Thus the eigenvalue of a real matrix B is pure imaginary numbers =b i. 


= r +1. 


A 

Find all the eigenvalues and eigenvectors of the matrix A 


3 -2 
6 -4 


(a) Solution. 


To determine eigenvalues of A, we compute the determinant of A — XI. 
We have 


det (A - XI) 


3 - A -2 
6 -4-A 

(3 — A)(—4 — A) + 12 
A 2 + A = A(A + 1). 


The eigenvalues are solutions of det (A — XI) — 0, hence eigenvalues of A are 0, — 1. 


Next, we find the eigenvector corresponding to the eigenvalue A = 0. 
Eigenvectors x are nonzero solutions of (^4 — 0/)x = 0. 

Thus, we solve Ax = 0 . The augmented matrix of the system is 


' 3 

-2 

0 

R 2 —2R\ 

" 3 

-2 

1 - 

-4 

0 

- > 

O 

0 


1 - 2/3 

1 - 

O 

O 

O 

_1 

- 1 

O 


Thus, if x 


xi 

.X 2 _ 


is a solution, then X\ 


9 

- 3 X 2 , hence 


x = x 2 


2/3 

1 


is an eigenvector corresponding to A = 0 for any nonzero scalar X 2 ■ 
























Finally, we find the eigenvectors corresponding to the eigenvalue A = — 1. 
In this case we need to solve (^4 — (—l)/)x = 0. 

The augmented matrix is 


- 1 

-2 

0 " 

-> 

1 

to 

-1 

0 " 

i?2 R\ 
- > 

' 2 

-1 

1 - 

-3 

l 

o 

1*2 

1 

to 

-1 

1 

o 


0 

0 


" 1 -1/2 

1- 

o 

0 0 

-1 

o 


It follows that the eigenvectors associated to A = — 1 are 



for any nonzero scalar x 2 • 


Let 


0 0 0 0 
1111 
0 0 0 0 
1111 


Find the eigenvalues of the matrix A. Also give the algebraic multiplicity of each eigenvalue. 


Solution (a). Eigenvalues of A and algebraic multiplies 


Eigenvalues and their algebraic multiplicities are determined by the characteristic polynomial p(t) of 
A. 

By definition, the characteristic polynomial of A is p(t) = det (A - tl). 

We have 


p(t) = det (A - tl) 

-t 0 0 0 

11-/1 1 
0 0-/0 
1 1 11 -/ 


1 -/ 1 


= -/ 


0 

1 


-/ 

1 


1 

0 

1 -/ 


by the first row cofactor expansion 


by the second row cofactor expansion 

= ? 2 ((i-0 2 -i) 

= t\t 2 - 21 ) 

= t\t~ 2). 


= - / -/ 


1 - / 1 
1 1 -/ 


Thus the characteristic polynomial is 



























p(t) = t\t- 2). 

From this, the eigenvalues of A are 0 and 2 with algebraic multiplicities 3 and 1, respectively. 

Solution (b). Geometric multiplicity 

We give two solutions for part (b). 

First Solution (b). (Finding the rank first) 

Recall that the geometric multiplicity of X is the dimension of the eigenspace E } = J\f(A - XI). That is, 
the geometric multiplicity of X is the nullity of the matrix A - XI. 

Let us now consider the case X = 0. 

We first find the rank of A - 0/ = A as follows. 


A - 01 = A = 


■o 

0 

0 

0 1 


■o 

0 

0 

0 1 


■1 

1 

1 

1 ■ 

1 

1 

1 

1 

r 4 -r 2 

1 

1 

1 

1 


0 

0 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 


0 

0 

0 

0 

1 

■ 

1 

1 

1 

■ 


0 

■ 

0 

0 

0 

■ 


0 

■ 

0 

0 

0 

■ 


The last matrix is in reduced row echelon form. 

Hence the rank of A is 1. 

The rank-nullity theorem says that 

rank of ^4 + nullity of ^4 = 4. 

Thus, the nullity of A = A - 01 is 3, and hence the geometric multiplicity of X = 0 is 3. 


Second Solution (b). (Finding a basis of the eigenspace) 

In this solution, we find a basis of the eigenspace E Q . 

By definition E 0 = J\f(A - 01) = J\f(A). 

Thus, the eigenspace E 0 is the null space of the matrix^. 
We solve the equation Ax = 0 as follows. 

The augmented matrix of this equation is 


[-4 10 ] = 


■o 

0 

0 

0 

O' 


-o 

0 

0 

0 

O' 


■1 

1 

1 

1 

O' 

1 

1 

1 

1 

0 

r 4 -r 2 

1 

1 

1 

1 

0 

R i > R 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

-* 

0 

0 

0 

0 

0 

-* 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

0 

1 


0 

■ 

0 

0 

0 

0 

1 


0 

■ 

0 

0 

0 

0 

■ 


Hence the solution satisfies 


“ Xj - X A 


and the general solution is 















Therefore, the eigenspace is 
E 0 = MA) 


1 4 


'-l 1 

1 


■-1 ■ 

0 


‘-1 1 

0 

\ 

X e C 4 

x = x 2 

0 

0 

1 1 

+ x 3 

1 

0 

■ ■ 

+ x 4 

0 

1 

1 1 

, for any x 2 , jc 3 , x 4 eC 

/ 


= Span 



Thus the set 



is a spanning set of Eq, and it is straightforward to check that the set is linearly independent. 
Hence this set is a basis of E 0 , and the dimension of Eq is 3. 

The geometric multiplicity of A = 0 is the dimension of E 0 by definition. 

Thus, the geometric multiplicity of X is 3. 


Let 


A = 


a 

1 


-1 

4 


be a 2 X 2 matrix, where a is some real number. Suppose that the matrix A has an eigenvalue 3. 


(a) Determine the value of a. 

(b) Does the matrix A have eigenvalues other than 3? 


Solution. 

(a) Determine the value of a 


































Since 3 is an eigenvalue of the matrix A, we have 

0 = det(A-31), 

where I is the 2 X 2 identity matrix. 

Thus we have 

0 = det(A - 37) 
a- 3 -1 

1 4-3 

a- 3 -1 

1 1 

= (« — 3 )( 1 ) -(-!)(!) = «- 2 . 


Thus the value of a must be 2. 

(b) Does the matrix A have eigenvalues other than 3? 
Let us determine all the eigenvalue of the matrix 



We compute the characteristic polynomial p(t) = det(A — tl ) of A. 
We have 


p(t) = det(A — tl ) 

_ 2 -t -1 
1 4 -t 

= (2-f)(4-f)-(-l)(l) 

= t 2 - 6t + 9 
= (t- 3) 2 . 

Since the eigenvalues are roots of the characteristic polynomial, solving (f — 3) 2 = Owe see that 
t — 3 is the only eigenvalue of A (with algebraic multiplicity 2). 

Hence the matrix A does not have eigenvalues other than 3. 


^ Determine all eigenvalues and their algebraic multiplicities of the matrix 


1 


A = 


a 


1 


a 1 
1 a 
a 1 


where a is a real number. 


Proof. 

(a) A + c is an eigenvalue ol\4 + cl. 

Let x be an eigenvector corresponding to the eigenvalue A. Then we have 


Ax = Ax. 












It follows that we have 

(A + c/)x = Ax + cx 
= Ax + cx 
= (A + c)x. 

Thus, we obtain 

(.A + c/)x = (A + c)x, 


where x is a nonzero vector. 

Hence A + c is an eigenvalue of the matrix A + cl, and x is an eigenvector corresponding to 
A — c. 

In summary, if A is an eigenvalue of A and x is an associated eigenvector, then A + c is an 
eigenvalue of A + cl and x is an associated eigenvector corresponding to A + c. 

(b) Algebraic multiplicities are the same 

Let p(t) — det (A — tl ) be the characteristic polynomial of A. 

Let q(t ) be the characteristic polynomial of the matrix A + cl. 

Then we have 

q(t) — det ( ( A + cl) — t) — det ( A — (t — c)) 

= p(t - C ). 

Let Ai,..., Afc be distinct eigenvalues of A with algebraic multiplicities m,..., respectively. 
Then we have 


k 

P (t) = ±f[(t - \r. 

i =1 

It follows that we have 

q(t) = p{t - c) 

k 

= ±l[(t-c-\ i r 

i =1 

=±n(t-(A i +c))”‘. 

i= 1 

From the last equation, we read that the eigenvalues of the matrix A + cl are A i + c with algebraic 
multiplicity ni for i — 1 ,. .., k. 

Thus, geometric multiplicities of A and A — c are the same. 

(c) How about geometric multiplicities? 

From part (a), we know that eigenvectors of A are eigenvectors of A — c. 

Reversing the argument, the eigenvectors of A + c are eigenvectors of A. 

Thus, the eigenvectors correspond one to one, and the eigenspace of A is the same as the eigenspace 
of A + c. 


Hence their geometric multiplicities are the same. 

Applications 

As applications of this problem, consider the following problems. 


Problem. Find all the eigenvalues and eigenvectors of the matrix 

'3 9 9 9' 

„ 9 3 9 9 

A — 

9 9 3 9 

9 9 9 3 


This is a problem of a Linear Algebra final exam at Harvard University. 

For a solution of this problem, see the post “Eigenvalues and eigenvectors of matrix whose diagonal 
entries are 3 and 9 elsewhere". 


Problem. Find the determinant of the following matrix 


A = 


6 

2 

2 

2 

2 


2 2 
6 2 
2 6 
2 2 
2 2 


2 2 
2 2 
2 2 
6 2 
2 6 


This is also a problem of a Linear Algebra final exam at Harvard University. 

See the post “Determinant of matrix whose diagonal entries are 6 and 2 elsewhere” for a solution. 


is an eigenvector of a matrix A corresponding to the eigenvalue 3 and that 

eigenvector of A corresponding to the eigenvalue —2. Compute A 2 
(<Stanford University ) 



Suppose that 


is an 


Solution. 

By the given conditions, we have 


' 1 ' 

= 3 

' 1 ' 

and A 

'2' 

= -2 

'2' 

.1. 


.1. 


.1. 


.1. 


We express the vector 


4 

3 


as a linear combination of vectors 



(*) 


a 


1 

1 



4 

3 


and we want to determine scalars a and b. The augmented matrix of the system is reduced as follows: 
































1 2 

4 " 


1 0 

2 






1 1 

3 


0 1 

1 


Thus the solution is 


a = 2,b=l 


and hence we obtain the linear combination 



Before computing A 2 


, we first calculate A 


= 2 

= 2 A 

= 2-3 

= 6 1 
1 


+ A 

+ (- 2 ) 

-2 2 

1 


as follows. 


by (*) 


Now, using the result of the above computation, we compute 


Therefore we obtained 


= A 
= A | 6 
= 6 A 

= 6-3 

26 
22 


-2 2 
.1 

- 2 A \ 2 
1 _ 

— 2 (— 2 ) 


by (*) 


'4' 


'26' 

.3. 


.22. 


Suppose that A is an n x n matrix with eigenvalue A and corresponding eigenvector v. 

(a) If A is invertible, is v an eigenvector of A~^l If so, what is the corresponding eigenvalue? If not, explain why 
not. 

(b) Is 3v an eigenvector of Al If so, what is the corresponding eigenvalue? If not, explain why not. 

(Stanford University) 

Hint. 

























































Use the defining relation Ax = Xx. 

Solution. 

(a) If A is invertible, is v an eigenvector of A ~ 

The answer is yes. First note that the eigenvalue X is not zero since A is invertible. 

By definition, we have Ax = Xx. Multiplying it by A ~ 1 from the left, we have 

x = XA~ l x. 

As noted above, X is not zero, so we divide this equality by X and obtain 

, 1 
A y ~r- 

Since v is not a zero vector, this implies that v is an eigenvector corresponding to the eigenvalue 1 IX 
of A. 

(b) Is 3v an eigenvector of A? 

The answer is yes. We calculate 

A(3v) = 3Av = 32v = X(3x). 

Thus we have A(3v) = 2(3 v). 

Since 3v ^ 0, this implies that 3v is an eigenvector corresponding to the eigenvalue X. 


O 


Let A be a 2 X 2 real symmetric matrix. Prove that all the eigenvalues of A are real numbers by considering the 
characteristic polynomial of A. 


Proof. 


Let A 


a 

c 


b‘ 

d 


T 

Then as A is a symmetric matrix, we have A = A. 


This implies that 


" a 

b 

c " 

d 

= 

■_ 

b ' 

d 







Hence we have b = c by comparing entries. 


Now, we find the characteristic polynomial p(t) of A. 
We have 








pit) = det (A - tl) 


a -1 b 
b d ~ t 


= (a-t)(d-t)-b 2 
= t 2 — (a + d)t + ad - b 2 . 


Note that the eigenvalues of A are roots of the characteristic polynomial p(t). Hence, it suffices to 
show that the roots of p(t) are real numbers. 

The quadratic polynomial has only real roots if and only if its discriminant is non-negative. 

The discriminant of p(t) is given by 

(< a + d ) 2 - 4 (ad - b 2 ) = a 2 + 2 ad + d 2 - 4ad + 4 b 2 
= a 2 - lad + d 2 + 4b 2 
= (a -d ) 2 + 4b 2 . 


Observe that the last expression is the sum of two squares of real numbers. Hence the discriminant of 
p{t) is nonnegative. 

We conclude that every 2x2 symmetric matrix has only real eigenvalues. 

Remark 

We also could find the eigenvalues directly. By the quadratic formula, the eigenvalues of A are 

a + d± ^ f(a + d ) 2 - 4(ad - b 2 ) a + d± ^j(a - d ) 2 + 4 b 2 
~2 “ 2 

and as the number inside the square root (discriminant) is positive, we conclude that the eigenvalues 
are real. 


Q Let 



b 

a 


be a 2 X 2 matrix, where a, b are real numbers. Suppose that 6^0. Prove that the matrix A does not have real 
eigenvalues. 

Proof. 

Let A be an arbitrary eigenvalue of A. 

Then the matrix A — XI is singular, where I is the 2 X 2 identity matrix. 

This is equivalent to having det [A — XI) = 0. 


We compute the determinant as follows. 
We have 










det {A - XI) 


a — A b 
—b a — A 


= (a - A) 2 - b(-b) 

= a 2 - 2aA + A 2 +b 2 
— X 2 - 2a\ + a 2 +b 2 . 


We solve the equation A 2 — 2aA + a 2 + b 2 = 0 by the quadratic formula and obtain 


2 a± yjA. a 2 — 4(a 2 + W) 2 a ± \/-46 2 

A — -—- — -—- 

2 2 

= a ± |&|i. 

Since b ^ 0 by assumption, the eigenvalue A = a ± \b\i is not a real number. 

As A is an arbitrary eigenvalue of A, we conclude that all eigenvalues of A are not real numbers. 



A = 



-12 

0 

5 


4 

-2 

-1 


Then find all eigenvalues of A 5 . If A is invertible, then find all the eigenvalues of A 1 


Proof. 


We first determine all the eigenvalues of the matrix A. 
The characteristic polynomial p(t ) of A is given by 


p(t) — det (A — tl ) 

3 - t -12 4 

= -1 -t -2 

-1 5 -l-t 


Using the first row cofactor expansion, we compute 

-1 -t 
-1 5 

— (3 — t)(f 2 + t + 10) 12(t — l) + 4(—5 — t) 

= -f 3 + 2 1 2 + 8t - 2. 


p(t) = (3 - t) 


-2 

-l-t 


~ (-12) 


-2 

-l-t 


+ 4 


Therefore the characteristic polynomial of A is 

p(t) — — T" 2T" 8 1 — 2 

and it can be factored as 

p(t) = -(t - 2 )(t - l)(t + 1). 


















The roots of the characteristic polynomials are all the eigenvalues of A. 
Thus, 2, dzl are the eigenvalues of A. 


To find the eigenvalues of A 5 , recall that if A is an eigenvalue of A , then A 5 is an eigenvalue of A 5 . 
It follows from this fact that 2 5 , (—l) 5 , l 5 are eigenvalues of A 5 . 

Since A 5 is a 3 X 3 matrix, its characteristic polynomial has degree 3, hence there are at most 3 
distinct eigenvalues of /4 5 . 

Because we have found three eigenvalues, 32, —1,1, of A 5 , these are all the eigenvalues of A 5 . 


Recall that a matrix is singular if and only if A = 0 is an eigenvalue of the matrix. 

Since 0 is not an eigenvalue of A, it follows that A is nonsingular, and hence invertible. If A is an 
eigenvalue of A, then y is an eigenvalue of the inverse A ~ l . 

So y, A = 2, ±1 are eigenvalues of A~ l . 

As above, the matrix A~ l is 3 X 3, hence it has at most three distinct eigenvalues. We have found 
1/2, ±1 are eigenvalues of A~ l , hence these are all the eigenvalues of A~ l . 


In summary, all the eigenvalues of A 5 are ±1, 32. The matrix A is invertible and all the eigenvalues 
of A~ l are ±1,1/2. 

Comment. 

Do not try to compute A 5 and A~ l and then find their eigenvalues. 

It will be tedious for hand computation. 


Let A be an eigenvalue of n X n matrices A and B corresponding to the same eigenvector x. 

(a) Show that 2 A is an eigenvalue of A + B corresponding to x. 

(b) Show that A 2 is an eigenvalue of AB corresponding to x. 

(The Ohio State University) 

Proof. 

(a) Show that 2/1 is an eigenvalue of A + B corresponding to x. 

Since A is an eigenvalue of A and B , and x is a corresponding eigenvector, we have 

Ax = Ax and Bx = Ax. 


Then we compute 


(A + B)x = Ax+ Bx 

= Ax + Ax by (*) 
= 2 Ax. 


Since x is an eigenvector, it is a nonzero vector by definition. 
Hence from the equality 



(A + B)x = 2Ax, 

we see that 22 is an eigenvalue of the matrix A + B and x is an associated eigenvector. 

(b) Show that A 2 is an eigenvalue of AB corresponding to x. 

We have 


(AB)x = A(Bx) 


= A(Ax) 

by (*) 

= A(Ax) 


= A(Ax) 

by (*) 

= A 2 x. 



Since x is a nonzero vector as it is an eigenvector, it follows from the equality 

(AB)x = 2 2 x 

that A is an eigenvalue of the matrix AB and x is a corresponding eigenvector. 


© Find all eigenvalues of the following n x n matrix. 


A = 


"0 0 
1 0 
0 1 


0 1 " 
0 0 
0 0 


0 0 ••• 1 0 



© Use the 1st row cofactor expansion 


0 For each smaller determinant matrix, use cofactor expansion inductively. 


Solution. 

We calculate the characteristic polynomial det(A — XI) of A. 
We use the cofactor expansion corresponding to the first row. 






-X 

0 ••• 

0 

1 


1 

-A ••• 

0 

0 

det(^4 - XI) = 

0 

1 

0 

0 


0 

0 ••• 

1 

-A 


-A 

0 ••• 

0 

0 


1 

-A ••• 

0 

0 

1 

-A ••• 

0 

0 


0 

1 

0 

0 

0 

1 

0 

0 

+ (-l) n+1 

0 

0 ••• 

1 

-A 

0 

0 ••• 

1 

-A 


0 

0 ••• 

0 

1 


Now we calculate the two determinants in the second equality separately. 

The easiest way is to note that the determinant of a triangular matrix is the product of its diagonal 
entries. 

Thus the first determinant is ( —A) n_1 and the second determinant is 1. 

(If you don’t know this fact, then use the first row cofactor expansion inductively to compute the first 
determinant. For the second one, use the first column cofactor expansion inductively.) 

Thus we obtain det( A - XI) = (~l) n X n + (-l) n+1 = (-l) n (A n - 1). 

Therefore eigenvalues are n-th roots of unity e 2m '" for i = 0,1,..., n — 1. 

Comment. 

The original determinant is not in a good shape for induction but once we apply the 1 st row cofactor 
expansion the smaller determinants obtained are better suited for induction. 

When I say in the proof “inductively”, I meant that you need to use mathematical induction to prove 
the claim (more) rigorously. 


<E> 


Find all the eigenvalues of the matrix A 


(The Ohio State University) 


0 10 0 
0 0 10 
0 0 0 1 
10 0 0 


Solution. 


We compute the characteristic polynomial p(t) of the matrix A as follows. 
We have 
















p(t) = det (A - tl) 

-t 10 0 

0-/10 


0 

0 

-t 

1 





1 

0 

0 


t 





-t 

1 

0 


0 

1 

0 

-1 

0 

-t 

1 

- 

0 

-t 

1 


0 

0 

-t 


1 

0 

-t 


by the first row cofactor expansion. 

The left determinant of the 3 x 3 matrix in (*) is ( - /) 3 since it is a diagonal matrix. 
We apply the first column cofactor expansion to the right determinant in (*) and obtain 

0 1 0 

1 0 

0 -/ 1 = =i 

-t 1 

1 0 -t 

It follows from (*) that 

pif) = (-*)(-o 3 - i*ab 4 -1- 

The eigenvalues of A are the roots of the characteristic polynomial p(t). 

Solving t 4 - 1 = 0, we obtain the eigenvalues 

± 1, ± z, 

where i = yj- 1. 

Note that t 4 - 1 = (/ - 1)(7 + 1)(/ - i){t + /). 


0 Let A be an n X n matrix. We say that A is idempotent if A 2 — A. 

(a) Find a nonzero, nonidentity idempotent matrix. 

(b) Show that eigenvalues of an idempotent matrix A is either 0 or 1. 
(The Ohio State University) 

Proof. 

(a) Nonzero, nonidentity idempotent matrix 


Let A = 


■o 

0 


1 “ 

1 . Then A is a nonzero, nonidentity matrix and A is idempotent since we have 


■0 

nr 

0 1 ■ 


■0 

1 ■ 

0 

'][ 

0 1 


0 

1 


(b) Eigenvalues of an idempotent matrix^ is either 0 or 1 



















Let X be an eigenvalue of the idempotent matrix A and let x be an eigenvector corresponding to the 
eigenvalue X. 

Namely we have 


Ax = Xx, x ^ 0. 


Then we compute A 2 x in two ways. 

First, since A is idempotent we have A 2 = A and thus we have 


A 2 x = Ax = Xx. 


Next, we compute as follows. 

(*) (*) 

A 2 x = A(Ax) = A(Xx) = X(Ax) = X(Xx) = X 2 x. 

Comparing these two computations, we obtain 

Xx = X 2 x. 

Since x is a nonzero vector (because x is an eigenvector), we must have 

Hence solving X(X - 1) = 0, the possible values for X is either 0 or 1. 

Thus, the idempotent matrix A only have eigenvalues 0 or 1. 

G A square matrix A is called idempotent if A 2 — A. 

(a) Let u be a vector in M n with length 1 . 

Define the matrix P to be P — uu T . Prove that P is an idempotent matrix. 

(b) Suppose that u and v be unit vectors in M n such that u and v are orthogonal. 

Let Q — uu T + vv T . Prove that Q is an idempotent matrix. 

(c) Prove that each nonzero vector of the form au + 6v for some a, b E M is an eigenvector corresponding to the 
eigenvalue 1 for the matrix Q in part (b). 


Proof. 


(a) Prove that P = uu T is an idempotent matrix. 


The length of the vector u is given by 



Since Hull = 1 by assumption, it yields that 


Let us compute P 2 using the associative properties of matrix multiplication. 
We have 


P 2 = (uu T )(uu T ) 

= u(u T u)u T 
= u(l)u T = UU T = P. 

Thus, we have obtained P 2 = P, and hence P is an idempotent matrix. 

(b) Prove that Q = uu T + w T is an idempotent matrix 
Since u and v are unit vectors, we have as in part (a) 

u T u = 1 and v T v = 1 . 

Since u and v are orthogonal, their dot (inner) product is 0. 

Thus we have 


U T V = V T U = o 

Using these identities, we compute Q 2 as follows. 

We have 

Q 2 = (uu T + vv T )(uu T + vv T ) 

= uu T (uu T + vv T ) + vv T (uu T + vv T ) 

= uu T uu T + uu T vv T + vv T uu T + vv T vv T 

r v v r 

= uu T + vv T = Q. 


It follows that Q is an idempotent matrix. 

(c) Prove that an + bx is an eigenvector 
Let us first compute Qu. We have 

Qu = (uu T + vv T )u 
= uu T u + vv T u = u. 

Y V 

Note that u is a nonzero vector because it is a unit vector. 

Thus, the equality Qu = u implies that 1 is an eigenvalue of Q and v is a corresponding eigenvector. 

Similarly, we can check that v is an eigenvector corresponding to the eigenvalue 1. 

Now let au + bx be a nonzero vector. 

Then we have 


Q(au + bx) = aQu + bQx = au + bx. 


It follows that au + bx is an eigenvector corresponding to the eigenvalue 1. 
Another way to prove (c) 



Another way to see this is as follows. 

As we saw above, the vectors u and v are eigenvectors corresponding to the eigenvalue 1. 
Hence u, v e E 1? where E x is an eigenspace of the eigenvalue 1 

Note that E x is a vector space, hence au + b\ is a nonzero vector in E v 
Thus, au + by is an eigenvector corresponding to the eigenvalue 1 as well. 

Related Question. 

Problem. 

(a) Find a nonzero, nonidentity idempotent matrix. 

(b) Show that eigenvalues of an idempotent matrix A is either 0 or 1. 


0 Let A be an n X n matrix. Suppose that the matrix A 2 has a real eigenvalue A > 0. Then show that either y/X 
or — y/X is an eigenvalue of the matrix A. 

Hint. 

Use the following fact: a scalar A is an eigenvalue of a matrix A if and only if 

det (A - A/) = 0. 


Proof. 

Since A is an eigenvalue of A 2 , the determinant of the matrix A 2 — XI is zero, where I is the n X n 
identity matrix: 


det {A 2 - XI) = 0. 

Now we have the following factorization. 

A 2 - XI = (A - VA I){A + y/X I). 


Taking the determinant of both sides, we obtain 

0 = det(A 2 - XI) - det ((A - V\I){A + \f\I)) 
= det (A - VX/) det (A + y/X I) 


by the multiplicative property of the determinant. 
Therefore we have either 


det(A — V^I) — 0 or det(A + x/XI) — 0 
Thus we conclude that either y/X or — y/X is an eigenvalue of the matrix A. 


Show that if A and B are similar matrices, then they have the same eigenvalues and their algebraic multiplicities 
are the same. 

Proof. 

We prove that A and B have the same characteristic polynomial. Then the result follows immediately 
since eigenvalues and algebraic multiplicities of a matrix are determined by its characteristic 
polynomial. 

Since A and B are similar, there exists an invertible matrix S such that S' -1 AS — B. 

Let Pa (t) and PB(t) denote the characteristic polynomials of A and B , respectively. 

We have 

p B (t ) = det (B - tl ) = det (S^AS - tl) 

= det (S~\A - tI)S ) = det(5 _1 ) det(A - tl)det(S) 

— det(^4 — tl) — PA(t). 

Here the fifth equality (*) follows from the fact that det(S' _1 ) = det (S') -1 . 

(Also note that even though matrix multiplication is not commutative in general but determinants are 
just numbers, thus we can change the order of the product of determinants.) 

Thus we showed that 2^4 (i) = ps(t) and this completes the proof. 


matrix A by 

— sin 9 0 
cos# 0 

0 1 

(a) Find the determinant of the matrix A. 

(b) Show that A is an orthogonal matrix. 

(c) Find the eigenvalues of A. 


^ For a real number 0 < 9 < 7T, we define the real 3x3 

cos# 
A — sin # 

0 


Solution. 

(a) The determinant of the matrix^ 

By the cofactor expansion corresponding to the third row, we compute 


det(v4) 


cos 9 — sin 9 0 

sin 9 cos 9 0 

0 0 1 

^ — sin 9 0 

0 - -i 

cos 9 0 

cos 2 9 + sin 2 9 

1 . 


cos 9 
sin 9 


0 

0 


+ 1 - 


cos 9 
sin# 


— sin# 
cos 9 


The last step follows from the famous trigonometry identity 













cos 2 9 + sin 2 9—1. 


Thus we have 


det(v4) = 1. 

(b) The matrix A is an orthogonal matrix 

We give two solutions for part (b). 

The first solution of (b) 

The first solution computes A T A and show that it is the identity matrix /. 
We have 



COS 9 ! 

sin 9 

o' 


cos 0 - 

-sin# 

O' 

II 

— sin # cos 6 

0 


sin# 

cos 0 

0 


0 

0 

1. 


0 

0 

1. 


cos 2 0 + sin 2 0 


0 

o' 


= 

0 


2 

COS 

9 + sin 2 

9 0 



0 



0 

1. 



'1 0 O' 






= 

0 10 

= /. 






0 0 1 







Similarly, you can check that AA T = / . Thus A is an orthogonal matrix. 

The second solution of (b) 

The second proof uses the following fact: a matrix is orthogonal if and only its column vectors form an 
orthonormal set. 

Let 


cos 9 


— sin# 


'o' 

sin# 

,A 2 = 

cos# 

5^3 = 

0 

0 


0 


_1_ 


be the column vectors of the matrix A . The length of these vectors are all 1. For example, we have 
|\A 1 11 = yj (cos 9) 2 + (sin #) 2 + 0 2 = \fl — 1. 


Similarly, we have | \A 2 11 = | \A 3 11 = 1. 

The dot (inner) product of A 1 and A 2 is 

A 1 • A 2 — cos 9 • (— sin 9) + sin 9 • cos 9 + 0-0 = 0 . 


Similarly, we have A\ - A 3 — A 2 • A 3 = 0. 


















Therefore, the column vectors A i, A 2 , A 3 are orthonormal vectors. Hence by the above fact, the matrix A 
is orthogonal. 


(c) The eigenvalues of A 

We compute the characteristic polynomial p{t) — det(^4 — tl ) as follows. 


p{t) — det(v4 — tl ) = 

= ( 1-0 


cos 9 — t — sin 9 
sin 9 cos 9 — t 
0 0 1 
cos 9 — t — sin 9 
sin 9 cos 9 — t 
= (1 — ^)(cos 2 9 — 2tcos9 + t 2 + sin 2 9) 
= (1 — t)(t 2 - ( 2 cos 9)t + 1 ). 


by the third row cofactor expansion 


The eigenvalues are roots of the characteristic polynomial p{t) , hence we solve 

p(t) — (1 — t)(t 2 — (2 cos 9)t + 1) = 0. 


One solution is t — 1 . The other solutions come from the quadratic polynomial in p{t) . 
By the quadratic formula, those solutions are 

t — cos 9 ± \J cos 2 9—1 

= cos 9 zb yj — sin 2 9 
= cos 9 zb i sin 9 


since sin 9 > 0 since 0 < 9 < n . 

Therefore the eigenvalues of the matrix A are 

l,cos 0 zb z sin 6 b 

Related Question. 

The following problem treats the rotation matrix in the plane. 


Problem. 

Consider the 2x2 matrix 


A = 


cos 9 
sin 9 


— sin 9 
cos 9 


where 9 is a real number 0 < 9 < 2n . 


(a) Find the characteristic polynomial of the matrix A . 

(b) Find the eigenvalues of the matrix A . 

(c) Determine the eigenvectors corresponding to each of the eigenvalues of A . 










Let A be an n X n matrix. Assume that every vector x in M n is an eigenvector for some eigenvalue of A. Prove 
that there exists A G 1 such that A = XI, where / is the n X n identity matrix. 


Proof. 

Let us write A = (ay ) . 

Let e, be the unit vector in R” whose /-th entry is 1, and 0 elsewhere. 

Then the vector e ; - is an eigenvector corresponding to some eigenvalue X /. That is, we have 

Aei = X ,e/. 


More explicitly, this shows that 


an 


"o' 

a 2 i 




= 

A, 

Mn—li 



_ flni _ 


. 0 . 


Therefore, it follows that am = 0 if A' / and an = X ,•. 

So we have 


Ai 0 ... 0 

0 X 2 ... 0 

0 0 ... X n 


Now we consider the vector v = 


1 

1 


G R” with all entries 1. Then v is an eigenvector of an 


.1 

eigenvalue X , that is, Av = kv . 

It follows that we have 


X 

X 


— Xv — Av — 


Xi 

0 


0 

^2 


0 

0 


1 

1 


x 2 


_x _ 

As a result, we must have X 
Thus, we obtain 


A = 




. 0 

0 


Xn . 

.1. 


=.. 

52 

II 





'Ai 

0 

... O' 


' X 

0 . 

.. o' 

0 

to 

... 0 

— 

0 

X . 

.. 0 

1 - 

O • • 

• • O 

... X n _ 


.0 

0 . 

.. X . 


J 


— XI 


as required. 























Let A be a square matrix. Prove that the eigenvalues of the transpose A T are the same as the eigenvalues of A. 


Proof. 

Recall that the eigenvalues of a matrix are roots of its characteristic polynomial. 

Hence if the matrices A and A T have the same characteristic polynomial, then they have the same 
eigenvalues. 

So we show that the characteristic polynomial p A {t) — det(v4 — tl ) ofv4 is the same as the 
characteristic polynomial p A T^t) — det(v4 T — tl ) of the transpose A T . 

We have 

p A T (t) — det(v4 T — tl) 

— det(v4 T — tI T ) since/ T = / 

= det ((A — tI) T ) 

= det(v4 — tl) since det(5 T ) = det (B) for any square matrix B 

— pA ( 0 * 

Therefore we obtain p A t (jt) — p A ( t) , and we conclude that the eigenvalues ofv4 and ^4 T are the same. 
Remark: Algebraic Multiplicities of Eigenvalues 

Remark that since the characteristic polynomials of A and the transpose A T are the same, it furthermore 
yields that the algebraic multiplicities of eigenvalues of A and A T are the same. 

Let A be an n by n matrix with entries in complex numbers C. Its only eigenvalues are 1, 2, 3,4, 5, possibly 
with multiplicities. What is the rank of the matrix A + I n , where I n is the identity n by n matrix. 

(University of California, Berkeley) 


Hint. 

The problem is asking if you understand the definition of eigenvalues and basic relation between the rank 
and non-singularlity. 

Solution. 

The matrix A + I n is nonsingular, otherwise — 1 is an eigenvalue of A but by assumption it is 
impossible. 

Since the matrix A + I n is nonsingular, it has full rank. Since A + I n is n by n matrix, its rank must be 
n. 

Comment. 

The solution is very short and simple. The point is to notice that A + I n is of the familiar form A — XI n . 
If the problem asked to find the rank of A — 6 1 n , then it would have been easier to notice this. 

Related Question. 




A related question is: 


What is the nullity of A ? 


See the post “Find the nullity of the matrix A + I if eigenvalues are 1, 2, 3, 4, 5 ” for a solution. 


Let A = 
A. 


a 

c 


b 

d 


be an 2 X 2 matrix. Express the eigenvalues of A in terms of the trace and the determinant of 


Solution. 

Recall the definitions of the trace and determinant of A : 

tr (A) — a + d and det(A) — ad — be. 


The eigenvalues of A are roots of the characteristic polynomial p(t ) of A . So let us first find p{t) . 
We have 

/ x -. / > a — t b 

p(t) — det(A — tl ) = 

c d — t 

— (a — t)(d — t) — be 

— t 2 — (a + d)t + ad — be 

— t 2 — tr (A)t + det(v4). 

Using the quadratic formula, the eigenvalues of A (roots ofp(t) ) are 

tr(v4) ± ^tr^) 2 — 4det (A) 

2 * 


Let A be an n X n nilpotent matrix, that is, A m = O for some positive integer m, where O is the n X n zero 
matrix. 

Prove that A is a singular matrix and also prove that / — A, I + A are both nonsingular matrices, where / is the 
n X n identity matrix. 



We give two proofs. The first one uses only some properties of the determinant of a matrix. 
The second one uses the theory of eigenvalues. 

We summarize several facts we will use below. 


In proof 1. 

A matrix is nonsingular if and only if it is invertible. 










0 A matrix is nonsingular if and only if its determinant is nonzero. 

0 For two square matrices A , B , we have 

det(^4i?) = det(v4) det(B). 


In proof 2. 


0 


Eigenvalues of A are all scalars X such that the matrix A 


XI is nonsingular. 


Proof 1. 

We use the fact that a matrix is nonsingular if and only if its determinant is nonzero. 
Since the determinant is multiplicative, we have 

0 = det(O) = det(^ m ) = det(^) m . 


This implies that det(^4) = 0 , and hence the matrix A is singular. 

Next, we show that the matrix I — A is nonsingular. Note that we have 

I — I — A m = (j l-A)(I + A+A 2 + ---+A m ~ 1 ). 

This implies that the matrix / + A + A 2 + ■ ■ ■ + A m ~ l is the inverse matrix of I — A . 
Thus the matrix I — A is invertible, and hence it is nonsingular. 


Next, we show that the matrix I + A is nonsingular. 

Note that we have 

/ — (—A) m = (/ +A)(/-A+A 2 -+ (-l ) m - 1 A m ~ 1 ). 

Since (—A) m = (-1 ) m A m = (-l) m O = O , we obtain 

/= (.I + A)(I-A+A 2 - + (-l) m ~ 1 A m ~ 1 ) 


and this implies that / + A is invertible matrix with the inverse matrix 

(I + A)- 1 = I — A + A 2 -+ (-l ) m - 1 A m ~ 1 . 


Therefore, the matrix I + A is nonsingular. 
Proof2. 



In the second proof, we will use an eigenvalue technique. 

Let A be an eigenvalue of the matrix A and let x be the corresponding eigenvector. Namely, we have 

Ax = Ax. 


Then we have 

0 = Ox = A m x 

= A m - 1 Ax=A m - 1 (Ax ) = ••• =A m x. 


It follows that A = 0 . Thus the only eigenvalue of A is zero. 

Recall that eigenvalues A are all scalars such that A — A/ is a singular matrix. 

Since A = 0 is an eigenvalue of A , the matrix A = A — 01 is singular. 

Since 1 is not an eigenvalue, we see that A — I is nonsingular, and hence I — A — —(A — I) is also 
nonsingular. 

Similarly, since — 1 is not an eigenvalue, we observe that / + A — A + / is nonsingular. 

Related Question. 

A natural source of nilpotent matrices is the differentiation linear transformation of vector space of 
polynomials. 

Check out the post “Differentiating Linear Transformation is Nilpotent” for problems and solutions about 
the nilpotency of the differentiation linear transformation. 

Related Question. 

Here is another problem about a nilpotent matrix. 


Problem. 

Suppose that A is an n X n nilpotent matrix and B is an n X n invertible matrix. 

Is the matrix B — A invertible? If so, give a proof. Otherwise, give a counterexample. 


Determine whether each of the following statements is True or False. 

(a) If A and B are n X n matrices, and P is an invertible n X n matrix such that A = PBP 1 , then 
det(A) = det (B). 

(b) If the characteristic polynomial of an n X n matrix A is p( A) = (A — l) n +2, then A is invertible. 

(c) If A 2 is an invertible n X n matrix, then A 3 is also invertible. 

(d) If A is a 3 x 3 matrix such that det (A) = 7, then det(2A T A -1 ) = 2. 

(e) If v is an eigenvector of an n X n matrix A with corresponding eigenvalue Ai, and if w is an eigenvector of 
A with corresponding eigenvalue A 2 , then v + w is an eigenvector of A with corresponding eigenvalue Ai + A 2 


(Stanford University, Linear Algebra Exam Problem) 

Solution. 


We use the following properties of the determinant. 


o For n X n matrices A, B, we have 

det(AB) = det(^4) det(P). 

If A is invertible, we have 

det(A _1 ) = det( J 4) _1 . 


Also recall that a matrix A is invertible if and only if det(^4) ^ 0. 

(a) True or False: If A = PBP -1 , then det(A) = det(-B). 

True. We have 

det(A) = det (PBP l ) 

= det(P) det(P) det(P ) -1 by properties 1, 2 
= det(P) det(P ) -1 det(P) since determinants are just numbers 
= det ( B). 

(b) Tme or False: Ifp(A) = (A — l) n + 2, then A is invertible. 

Tme. Note that we have p(0) = (—l) n +2^0. 

Thus 0 is not an eigenvalue of A. 

Recall that a matrix is invertible if and only if it does not have 0 as an eigenvalue. 

Thus, A is invertible. 

(c) Tme or False: If A 2 is an invertible n X n matrix, then A 3 is also invertible. 

Tme. If A 2 is invertible, then we have det(^4 2 ) 7 ^ 0. 

Then by property 1, we have 

det(^4 ) 2 = det(A 2 ) 7 ^ 0 , 

and hence det (A) 7 ^ 0. 

It follows from property 1 that we have 

det (A 3 ) = det(A ) 3 / 0 . 


Thus the matrix A 3 is invertible. 

(d) Tme or False: If A is a 3 X 3 matrix such that det (A) = 7, then det (2A T A ~ l ) 

False. Recall that we have det(A T ) = det (A). 

Together with properties 1, 2, we have 


det(2^ T A“ 1 ) = det(2/)det(^ T )det(A“ 1 ) 
= det(27) det(j4) det( J 4) _1 
= det(2 1) = 8. 


Here I is the 3 X 3 identity matrix. 


Remark that in general for an n X n matrix and a scalar r, we have 

det(r^l) = r n det(A). 

(e) True or False: v + w is an eigenvector of A with corresponding eigenvalue Ai + A 2 . 


False. For example, let A be a 2 X 2 identity matrix. 
The only eigenvalue of A is 1. 


Note that v = 


and w = 


are both eigenvector corresponding to the eigenvalue 


Ai = A 2 = 1. 

However, Ai + A 2 = 2 is not an eigenvalue of A, and thus v + w cannot be an eigenvector 
corresponding to Ai + A 2 — 2. 

Note that the set of eigenvectors corresponding to an eigenvalue A together with the zero vector form 
a vector space, called the eigenspace of A. Thus, if v and w are both eigenvectors of A, then the sum 
v + w is also an eigenvector of A or the zero vector. 


^ Let A = (a,ij) be an n X n matrix. We say that A = (dij) is a right stochastic matrix if each entry dij is 
nonnegative and the sum of the entries of each row is 1. That is, we have 

d{j ^ 0 and an + di 2 + • • • + d{ n = 1 

for 1 < i, j < n. Let A — (dij) be an n X n right stochastic matrix. Then show the following statements. 

(a) The stochastic matrix A has an eigenvalue 1. 

(b) The absolute value of any eigenvalue of the stochastic matrix A is less than or equal to 1. 


Proof. 

(a) The stochastic matrix A has an eigenvalue 1. 
We compute that 


~a n 

an . 

. . dif 2 


"I" 


011 + 012 + * 

• * + 01/2 


T" 

021 

a 2 2 • ■ 

■ • 0 2 n 


l 

= 

021 T" 022 T - * 

* • + 02/2 

= 1 • 

l 

_ Clnl 

On 2 • ■ 

• • o nn _ 


_l_ 


_ a n 1 + a n 2 • • 

+ 0/2/2 


.1. 


Here the second equality follows from the definition of a right stochastic matrix. 
(Each row sums up to 1.) 

















1 


This computation shows that 1 is an eigenvector of A and 


is an eigenvector corresponding to the 


1 


eigenvalue 1. 

(b) The absolute value of any eigenvalue of the stochastic matrix A is less than or equal to 1. 

Let X be an eigenvalue of the stochastic matrix A and let v be a corresponding eigenvector. 

That is, we have 


Ay — Xy. 

Comparing the z-th row of the both sides, we obtain 

ciii Vi + a i2 v 2 H-h am v„ = av, (*) 

for i — 1,..., n . 

Let 


\n\ = max{|vi|,|v 2 |,...,|v„|}, 


namely is the entry of v that has the maximal absolute value. 


Note that | | > 0 since otherwise we have v = 0 and this contradicts that an eigenvector is a nonzero 

vector. 

Then from (*) with i — k, we have 

\M • \vk\ = \akivi + a k 2V 2 H-b a kn v n | 

< Oki l v i I + a 2 |v 2 1 + • • • + ah, | v n | (by the triangle inequality and > 

< a k i \v k | + a 2 \v k | 4- V a kn \v k \ (since \v k \ is maximal) 

= ( a ki + a k 2 + • • • + a kn ) | v k | = \v k |. 

Since \v k \ > 0, it follows that 

A<1 


as required. 


Remark. 

A stochastic matrix is also called probability matrix, transition matrix, substitution matrix, or Markov 
matrix. 




Suppose that A and /z are two distinct eigenvalues of a square matrix A and let x and y be eigenvectors 
corresponding to A and fi, respectively. If a and b are nonzero numbers, then prove that ax + by is not an 
eigenvector of A (corresponding to any eigenvalue of A). 


Hint. 

We use the following fact in the proof. 

Fact: Two eigenvectors corresponding to distinct eigenvalues are linearly independent. 





Proof. 

Seeking a contradiction, we assume that ax + by is an eigenvector corresponding to an eigenvalue 
Thus we have 

A (ax + by) — <f(ax + by). (* 


We calculate the left hand side of this equality as follows. 

We have 

A (ax + by) — aAx + bAy 
= aAx + bfiy 


since Ax — Ax, Ay = juy by defintion. 
Therefore, from (*) we obtain 


a(X - C)x + b(ju - C) y = 0. 


Recall that eigenvectors corresponding to distinct eigenvalues are linearly independent. Thus x and y are 
linearly independent. 

Thus, the coefficients of the above linear combinations must be zero: 

a(A — 0 — 0 and b(ju — Q = 0. 

Since a ^ 0, b 0, this implies that we have 


x = C = n, 

and this is a contradiction because A and // are supposed to be distinct. 
Hence, ax + by cannot be an eigenvector of any eigenvalue of A . 


^ Suppose that A is an n X n singular matrix. Prove that for sufficiently small e > 0, the matrix A — el is 
nonsingular, where / is the n X n identity matrix. 


Hint. 

Consider the characteristic polynomial p(t) of the matrix A . 

Note that the eigenvalues of A are the roots of p(t ). 

Thus if e is not an eigenvalue, then p(c) ^ 0. 

Proof. 

Let p(t) — Y\" = (A i — t) be the characteristic polynomial for A , where A / are eigenvalues of A . 




Let A i 0 be the nonzero eigenvalue of A of the smallest absolute value. 

That is | A i Q | < \A i | for any nonzero eigenvalue A / . 

Then for any 0 < e < \A / 0 1, we have det(^4 — el) — p(e) ^ 0 , otherwise e would be an eigenvalue 
of A but it is impossible because of the minimality of A ; 0 . 

Therefore the matrix A — el is nonsingular for all 0 < e < | A io \ . 

Let A and B be n X n matrices. Prove that the characteristic polynomials for the matrices AB and BA are the 
same. 


Hint. 

o Consider the case when the matrix A is invertible. 


0 Even ifv4 is not invertible, note thatv4 — el is invertible matrix for sufficiently small e. 
(See Problem Perturbation of a singular matrix is nonsingular for a proof of this fact.) 


^ Take the limit e 0. 


Proof. 

We want to show that | AB — AI\ — \ BA — AI | , where A is an unknown and / is the n X n identity 

matrix. 


First let us consider the case when the matrix A is invertible. 
So suppose that A is invertible. 

Then we have 


|AB - U\ = |A ~ 1 {AB - H)A\ = |BA - XI\. 

Here we use the fact that \A\\A 1 = \AA 1 = |/| = 1 . 

Thus when A is invertible, the claim is proved. 


Now we consider the general case. 

Whether or not A is invertible, the matrix A — el is invertible for sufficiently small e. 

(More precisely, if e is smaller than absolute values of all nonzero eigenvalues of A . then A — el is 
invertible. See Problem Perturbation of a singular matrix is nonsingular for a proof.) 


Then by the previous case, we have 






|(A - cI)B - U\ = | B(A - cl) - U\. 

Taking the limit c -A 0 , we obtain 

\AB-U\ = \BA-U\. 

Comment. 

When you study hard linear algebra, you might have forgotten about calculus. 

This problem suggests that for some problems in linear algebra, techniques from calculus (like limits) 
might help to solve them. 


O Let A be an n X n real matrix. Prove that if A is an eigenvalue of A, then its complex conjugate A is also an 
eigenvalue of A. 

Proof 1. 

Let x be an eigenvector corresponding to the eigenvalue A. Then we have 

Ax = Ax. 


Taking the conjugate of both sides, we have 

Ax = Ax. 


Since A is a real matrix, it yields that 

Ax = Ax. (*) 

Note that x is a nonzero vector as it is an eigenvector. Then the complex conjugate x is a nonzero 
vector as well. 

Thus the equality (*) implies that the complex conjugate A is an eigenvalue of A with corresponding 
eigenvector x. 

Proof 2. 

Let p(t) be the characteristic polynomial of A. 

Recall that the roots of the characteristic polynomial p(t ) are the eigenvalues of A. 

Thus, we have 


p(X) = o. 

As A is a real matrix, the characteristic polynomial p(t) has real coefficients. 
It follows that 


p(t) =p(t). 


The previous two identities yield that 

P( A) = p(A) = 0 = 0, 

and the complex conjugate A is a root of p(t ), and hence A is an eigenvalue of A. 





o Let A be a square matrix. Prove that the eigenvalues of the transpose A T are the same as the eigenvalues of A. 


Proof. 

Recall that the eigenvalues of a matrix are roots of its characteristic polynomial. 

Hence if the matrices A and A T have the same characteristic polynomial, then they have the same 
eigenvalues. 

So we show that the characteristic polynomial Pa{ t) — det(^4 — tl ) of A is the same as the 
characteristic polynomial p A v[t) = det(^4 T — tl ) of the transpose A T . 

We have 

p A T (t) — det(v4 T — tl) 

— det(v4 T — tI T ) since I T — I 

— det ((A — tI ) T ) 

= det (A — tl) since det(5 T ) = det (B) for any square matrix B 

= Pa (0- 

Therefore we obtain p A ^[t) — Pa ( t) , and we conclude that the eigenvalues ofv4 and ^4 T are the same. 
Remark: Algebraic Multiplicities of Eigenvalues 

Remark that since the characteristic polynomials of A and the transpose A T are the same, it furthermore 

rp 

yields that the algebraic multiplicities of eigenvalues of A and A are the same. 

Let A be an n X n matrix. Its only eigenvalues are 1, 2, 3,4, 5, possibly with multiplicities. What is the nullity 
of the matrix A + I n , where I n is the n X n identity matrix? 

{The Ohio State University) 


Solution. 

Recall that X is an eigenvalue of the matrix A if and only if the matrix A — XI is singular. 

Thus, A + / is nonsingular, otherwise — 1 is an eigenvalue of A . 

Since A + I is nonsingular, the we have the null space J\f (A + I) = {0} , and hence the nullity of 
A + I is zero. 

Suppose that a real symmetric matrix A has two distinct eigenvalues a and /3. Show that any eigenvector 
corresponding to a is orthogonal to any eigenvector corresponding to /3. 

{Nagoya University) 


Hint. 

T* r\ 

Two vectors u and v are orthogonal if their inner (dot) product u • v := u v = 0 . 
T 

Here u is the transpose of u. 





A fact that we will use below is that for matrices A and B , we have ( AB ) T = B T A T . 

Proof. 

Let u, v be eigenvectors corresponding to a,/? , respectively. 

Namely we have 

Am — au and Ay — fiv. (*) 

To prove that u and v are orthogonal, we show that the inner product u • v = 0 . 

Keeping this in mind, we compute 

a(u • v) = (au) • v 
Au ■ y = (Au) t v 

= u t A t v (This follows from the fact mentioned in the hint above) 

= u T dv (since A is symmetric.) 

= u T /?v = /?(u T v) = u • v). 


Therefore we obtain 


a(u-v) =yS(u-v), 


and thus 


(a — /?)( u • v) = 0. 

Since a and /? are distinct, a — J3 ^ 0 . 

Hence we must have 


u - v = 0, 


and the eigenvectors u, v are orthogonal. 




Show that eigenvalues of a Hermitian matrix A are real numbers. 


Proof 1. 

Let A be an arbitrary eigenvalue of a Hermitian matrix A and let x be an eigenvector corresponding to the 
eigenvalue A . 

Then we have 


Ax = Ax. (*) 

Multiplying by x from the left, we obtain 

x T (Ax ) = x T (/lx) 




We also have 

x T (Ax) = (Axfx = x T A T x. 

The first equality follows because the dot product u • v of vectors u, v is commutative. 
That is, we have 

T T 

u • v = u v — v u = v • u. 

We applied this fact with u = x and v = Ax . 




Thus we obtain 

x T A T x = 2||x||. 

Taking the complex conjugate of this equality, we have 

T -T 

x A x = A ||x||. 

(Note that x = x. Also | |x| | = | |x| | because | |x| | is a real number.) 

-T 

Since the matrix A is Hermitian, we have A —A . 


This yields that 


X||x|| =’x T .4x 

S x T Xx 
= X||x||. 


Recall that x is an eigenvector, hence x is not the zero vector and the length | |x| | 0 . 

Therefore, we divide by the length | |x| | and get 

A = x. 

It follows from this that the eigenvalue A is a real number. 

Since A is an arbitrary eigenvalue of A , we conclude that all the eigenvalues of the Hermitian matrix A are 
real numbers. 

Proof 2. 

Let A be an arbitrary eigenvalue of a Hermitian matrix A and let x be an eigenvector corresponding to the 
eigenvalue A . 

Then we have 

Ax — Ax. (*) 

Multiplying by x from the left, we obtain 


Now we take the conjugate transpose of both sides and get 


x T A T x = X I 


x 


-T 

Since A is a Hermitian matrix, we have A —A . 


Then the left hand side becomes 


x t / T x = x t Ax 

(*) _Tn 
= X AX 


= / X . 


(*** 




Therefore comparing (***) and (****) we obtain 

A||x|| = X||x||. 

Since x is an eigenvector, it is not the zero vector and the length | |x| | 0 . 

Dividing by the length | |x| |, we obtain X — X and this implies that X is a real number. 

Since X is an arbitrary eigenvalue of A , we conclude that every eigenvalue of the Hermitian matrix A is a 
real number. 

Corollary 

Every real symmetric matrix is Hermitian. Thus, as a corollary of the problem we obtain the following fact: 


Eigenvalues of a real symmetric matrix are real. 


Related Question. 


Problem. Let A be an n X n real symmetric matrix. 

Prove that there exists an eigenvalue X of A such that for any vector vGK\we have the inequality 

v • Ay < A||v|| 2 . 


Note that the inequality makes sense because eigenvalues ofv4 are real by Corollary. 

For a proof of this problem, see the post “Inequality about Eigenvalue of a Real Symmetric Matrix 46 . 


O Prove that the matrix 


A = 


1 

1.00001 

1 


1.00001 

1 

1.00001 


1 

1.00001 

1 


has one positive eigenvalue and one negative eigenvalue. 
(University of California, Berkeley) 


Solution. 

Let us put a = 1.00001 . We compute the characteristic polynomial det (^4 — tl ) of the given matrix A 
as follows. 

We have 


det (^4 — tl) 


1 

- 1 

a 

1 


a 

1 -t 

a 


1 

a 

1 - 

(1 

-0 

1 -t 

a 

a 

1 - 



1 -t 
a 


by the cofactor expansion corresponding to the first row. 

Simplifying this, we obtain 

det (^4 — tl) = — t(t 2 — 3 t + 2 — 2 a 2 ). 


The eigenvalues of A are roots of this characteristic polynomial. 

Hence 0 is an eigenvalue of A . Let A i and A 2 be other two eigenvalues of A . 

Then we have 


det(^4 — tl) — — t(f — A\)(f — A 2 ) — — t(f 2 — (A r A 2 ) H" A iA 2 )• 
Therefore we have 


A 1 A 2 — 2 — 2a 2 . 

Since 2 — 2a 2 = 2(1 — a 2 ) < 0 as a — 1 .0001 > 1 , the product A\A 2 is negative, and we 
conclude that one of them is positive and the other is negative. 

(Note that if the constant c term of a quadratic polynomial x 2 + bx + c is negative, then the roots of the 
polynomial are real and one is negative and the other is positive.) 

In summary, the eigenvalues of the matrix A are 0 and one positive eigenvalue and one negative 
eigenvalue. 

Suppose that A and fi are two distinct eigenvalues of a square matrix A and let x and y be eigenvectors 
corresponding to A and n, respectively. If a and b are nonzero numbers, then prove that ax + by is not an 
eigenvector of A (corresponding to any eigenvalue of A 













Hint. 

We use the following fact in the proof. 

Fact: Two eigenvectors corresponding to distinct eigenvalues are linearly independent. 

Proof. 

Seeking a contradiction, we assume that ax + by is an eigenvector corresponding to an eigenvalue 
Thus we have 

A (ax + by) — C(ax + by). (*) 


We calculate the left hand side of this equality as follows. 

We have 

A (ax + by) — aAx + bAy 
= aAx + bfiy 


since Ax — Ax, Ay — juy by defmtion. 
Therefore, from (*) we obtain 


a(A — C)x + b(/u — C)y — 0. 


Recall that eigenvectors corresponding to distinct eigenvalues are linearly independent. Thus x and y are 
linearly independent. 

Thus, the coefficients of the above linear combinations must be zero: 

a(A — C) = 0 and b(ju — Q — 0. 

Since a ^ 0, b 0, this implies that we have 

^ = c= 

and this is a contradiction because A and // are supposed to be distinct. 

Hence, ax + by cannot be an eigenvector of any eigenvalue of A . 


O 


Consider a polynomial 


p{x) = x n + a ra _ iX n 1 H-1- aiX + a 0 , 


where are real numbers. Define the matrix 


"0 

0 

... 0 

— ao 

1 

0 

... 0 

-CLi 

0 

1 

... 0 

—a 2 

.0 

0 

... 1 

^n— 1 _ 






Then prove that the characteristic polynomial det {xl — A) of A is the polynomial p{x) . The matrix is called 
the companion matrix of the polynomial p(x ). 


G> 


Let A be an n X n matrix. Suppose that all the eigenvalues A of A are real and satisfy A < 1. Then show that 
the determinant det(/ — ^4) >0, where / is the n X n identity matrix. 


□ 


Consider the 2x2 matrix 


A = 


COS# 

sin# 


— sin# 
cos# 


where # is a real number 0 < # < 27T. 

(a) Find the characteristic polynomial of the matrix A. 

(b) Find the eigenvalues of the matrix A. 

(c) Determine the eigenvectors corresponding to each of the eigenvalues of A. 


Proof. 

(a) Find the characteristic polynomial of the matrix A . 

The characteristic polynomial p{t) of A is computed as follows. 

Let / be the 2x2 identity matrix. 

We have 

p(t) — det(v4 — tl) 

cos 6 — t — sin 9 
sin 9 cos 9 — t 

= (cos 9 — t ) 2 + sin 2 9 (*) 

= t 2 - (2 cos 9)t + cos 2 9 + sin 2 9 
— t 2 — (2 cos 9)t + 1 


by the trigonometry identity cos 2 9 + sin 2 9=1. 

Hence the characteristic polynomial of A is 

p(t) — t 2 — (2 cos 9)t + 1. 

(b) Find the eigenvalues of the matrix^ . 

The eigenvalues ofv4 are roots of the characteristic polynomial p{t ). 
So let us solve 

p{t) — t 2 — (2 cos 9)t + 1 = 0. 

By the quadratic formula, we have 









Hence eigenvalues of A are 


2 cos 9 ± yj(2 cos #) 2 — 4 
= 2 

= cos # ± -\/cos 2 0—1 
= cos 9 ± a/— sin 2 9 
= cos# ± / sin# = e ±ld . 


cos # ± / sin # = e ±l °. 


Alternatively, we could have used the equation (*). 
Then we have 


p{t) — (cos# — t) 2 + sin 2 9 = 0 
(cos 9 — t) 2 = — sin 2 # 
cos# — t = it/sin# 


and thus, we obtain the eigenvalues 

cos # ± / sin # = e ±lS . 

(c) Determine the eigenvectors 

Let us first find the eigenvectors corresponding to the eigenvalue X = cos # + / sin #. 
We have 


A — XI = 


—/sin# 

sin# 


— sin# 
—/sin# 


If # = 0, jc , then sin 9=0 and we have 


A 



0 

0 


and thus each nonzero vector of R 2 is an eigenvector. 


lfX^O,7V , then sin# ^ 0. 

Thus we have by elementary row operations 


—/sin 9 

— sin 9 

-!—Ri 

sin # 

1 

—i 

R 2 —R 1 

1 -/1 

sin 9 

—i sind_ 

-> 

.1 

—i _ 

-> 

- 1 

0 

0 


sin # 


It follows that the eigenvectors for A are 




















for any nonzero scalar t . 

The other eigenvalue is the conjugate A of A . 

Since A is a real matrix, the eigenvectors of A are the conjugate of those of A . 
Hence the eigenvectors corresponding to A is 

—i 

Lr 

for any nonzero scalar t . 


In summary, when 9 — 0, n , the eigenvalues are 1, —1 , respectively, and every nonzero vector of R 2 is 
an eigenvector. 

If 9 0, 7T, then the eigenvectors corresponding to the eigenvalue cos 9 + i sin 9 are 

i 

ir 

for any nonzero scalar t . 

The eigenvectors corresponding to the eigenvalue cos 9 — i sin 9 are 

—i 

Lr 

for any nonzero scalar t. 

Related Question. 

For a similar proble, try the following. 


Problem. 

For a real number 0 < 0 < 7T, we define the real 3x3 matrix A by 


A = 


cos 9 — sin 9 0 

sin 9 cos 9 0 

0 0 1 


(a) Find the determinant of the matrix A . 

(b) Show that A is an orthogonal matrix. 

(c) Find the eigenvalues of A . 


Let A be an n X n matrix and let Ai,..., A n be its eigenvalues. Show that 

(a) det(A) = nr=i Ai. 

(b) tr (A) = £? =1 A,. 












Use the definition of eigenvalues (the characteristic polynomial). 


0 Compare coefficients. 


Plan 2. 

^ Make A upper triangular matrix or in the Jordan normal/canonical form. 


0 Use the property of determinants and traces. 


Proof. [Method 1] 

(1) Recall that eigenvalues are roots of the characteristic polynomial p(X) — det(^4 — XI n ) 
It follows that we have 


det(^4 — XI n ) 


an — x 

ai2 

Cl2l 

a 

to 

to 

' 1 

N 

ttnl 

d m 2 


Ml,n 

@2 ,n 


®nn 


-X 


i=l 


Letting X — 0, we see that det (A) — ^ i 


and this completes the proof of part (a). 


(2) Compare the coefficients of X n 1 of the both sides of (*). 

The coefficient of X n ~ l of the determinant on the left side of (*) is 

( — 1)” 1 (#11 + ^22 + * * * a nn ) — ( — 1)” 1 tr (A). 

The coefficient of X n ~ l of the determinant on the right side of (*) is 

i=l 


Thus we have tr (A) = Y^-i 2 i . 

Proof. [Method 2] 


Observe that there exists ann X n invertible matrix P such that 






P~ X AP 


X i * 

o a 2 


* 

* 


0 0 ••• k n 




This is an upper triangular matrix and diagonal entries are eigenvalues. 

(If this is not familiar to you, then study a “triangularizable matrix” or “Jordan normal/canonical form”.) 


(1) Since the determinant of an upper triangular matrix is the product of diagonal entries, we have 

n 

~[kt = det (P~ l AP) = det(P _1 )det(^)det(P) 

i= 1 

= det(P) _1 det(^4) det(P) = det(^4), 


where we used the multiplicative property of the determinant. 


(2) We take the trace of both sides of (**) and get 

n 

= tr (P-'AP) = tr(^). 

1=1 


(Here for the last equality we used the property of the trace that tr (AB) — tr {BA) for any n X n 
matrices A and B .) 

Thus we obtained the result tr(v4) = 'Ym=i ^ i • 

Comment. 

The proof of (1) in the first method is simple, but that of (2) requires a bit observation, especially when we 
find the coefficient of the left-hand side. 

The proof of (2) in the second method is simpler although you need to know about the Jordan 
normal/canonical form. 

These two formulas relate the determinant and the trace, and the eigenvalue of a matrix in a very simple 
way. 


(a) A 2 x 2 matrix A satisfies tr(^4 2 ) = 5 and tr(A) = 3. 

Find det (^4). 

(b) A 2 X 2 matrix has two parallel columns and tr(^4) = 5. Find tr(^4 2 ). 

(c) A 2 X 2 matrix A has det (^4) = 5 and positive integer eigenvalues. What is the trace of Al 
(Harvard University) 


Solution. 

For (a) and (b), we give two solutions. The first one does not use the knowledge of eigenvalues, and the 
second one uses eigenvalues. 





Solution 1 of (a) A 2 X 2 matrix^ satisfies tr (A 2 ) = 5 and tr (A) = 3. Find det (^4). 
Let 


A 


a b 
c d 


Then we have 


A 2 - 

a b 


a b 


a 2 + be ab + bd 

/I - 

_c d_ 


_c d_ 


_ac + cd be + d 2 _ 


Since tr(^4 2 ) = 5 and tr(^4) = 3 , we obtain 

5 = tr(v4 2 ) = ( a 2 + be) + (be + d 2 ) — a 2 + 2 be + d 2 and 
3 = tr(^4) = a + d. 


We find the determinant det(v4) — ad — be as follows. 
We have 


det(v4) — ad — be 


-((a + d) 2 


= |(3 2 — 5) = 2. 


(a 2 + 2 be + d 2 )) 


Thus, we obtain det(^4) = 2 . 
Solution 2 of (a) 

Let A i and A 2 be eigenvalues of A . 
Then we have 


3 = tr(^4) = A 1 + A 2 and 
5 = tr(^4 2 ) = A\ + A\. 


Here we used two facts. 

The first one is that the trace of a matrix is the sum of all eigenvalues of the matrix. 

The second one is that A 2 is an eigenvalue of A 2 if A is an eigenvalue of A , and these are all the 
eigenvalues of A 2 . 

Since the determinant of A is the product of eigenvalues of A , we have 

det(^4) — A\A 2 

= 2'(^ 1 +^2) 2 - (^1 +^2)) 

= f(3 2 -5) 

= 2 . 


Hence we have det (A ) = 2 . 










Solution 1 of (b) A 2 X 2 matrix has two parallel columns and tic (A) = 5. Find tr (A 2 ). 
Since two columns are parallel, we can write A as 

A = 


a ra 
c rc 


Then we have 


5 = tr(^4) — a + rc. 


We use the formula in Solution 1 of (a) for tr(^4 2 ) with b — ra and d = rc , and we compute 

tr(^4 2 ) — a 2 + 2 (ra) + (rc) 2 
— (a + rc) 2 
= 5 2 = 25. 


Thus, we find tr(v4 2 ) = 25. 

Solution 2 of (b) 

Since two columns are parallel, the matrix A is singular. Hence A has an eigenvalue 0. 

Since the sum of all the eigenvalues is tr(^4) = 5 , we see that 0 and 5 are eigenvalues of A . 

It follows that 0 and 25 are eigenvalues of A 2 . Hence 

tr(^ 2 ) = 0 + 25 = 25. 

Solution of (c) A 2 X 2 matrix + has det(/l) = 5 and positive integer eigenvalues. What is the 
trace of ^4 ? 

The product of eigenvalues ofv4 is the determinant det(v4) = 5 . 

Since eigenvalues are positive integers, it follows that 1 and 5 are eigenvalues of A . 

It follows that 


tr(yt) = 1 + 5 = 6. 


$ 


Let n be an odd integer and let A be an n X n real matrix. Prove that the matrix A has at least one real 


eigenvalue. 


Proof 1. 

Let p{t) — det(^4 — tl) be the characteristic polynomial of the matrix A . 

It is a degree n polynomial and the coefficients are real numbers since A is a real matrix. Since n is odd, 
the leading term of p{t) is — t n . That is, we have 

p{t) — —t n + lower terms. 






(Note: if you use the alternative definition of characteristic polynomial p{t) — det {tl — A ) , then the 
leading term is t n . You can easily modify the proof for this case.) 

Therefore, as t increases the polynomial p{t) tends to — oc: 


Similarly, we have 


lim = —oc. 

t—> oo 


lim = oc. 

t —>—oo 

By the intermediate value theorem, there is — oo < 2 < oc such thatp(2) = 0 . Since a root of p{t) is 
an eigenvalue of A , we have obtained a real eigenvalue A of A. 

Proof 2. 

Let p{t) — det(v4 — tl) be the characteristic polynomial of the matrix A and write it as 

pit) = IL A ' _ 

i= 1 

where X / are distinct eigenvalues of A and n / is the algebraic multiplicity of 2 /. 

Since A is a real matrix, the coefficients ofare real numbers. So we have 

pit) =pit). 


Thus, we have 

pit) =pit)= n<T -1 Y‘ ■ 

i—1 


Replacing t by t , we obtain 

pw= n< x '-<)"'■ 

z=l 


This yields that the complex conjugate X / is an eigenvalue with algebraic multiplicity /. 
Hence each non-real eigenvalue X appears as a pair (2,2) counting multiplicity. 

Thus, there are even number of non-real eigenvalues of A . 

The number of all eigenvalues of A counting multiplicities is n . 

Since n is odd, there must be at least one real eigenvalue of A . 

Let 



a o 

ai 

••• a n _2 

^n—1 

On —1 

ao 

... O n —S 

On—2 

On—2 

1 

••• &n—4 

On—S 

0,2 

as 

ao 

ai 

- Ol 

02 

••• o n — i 

Oq _ 


be a complex n X n matrix. Such a matrix is called circulant matrix. Prove that the determinant of the circulant 
matrix A is given by 

n— 1 

det(A) = + ai( k + a 2 ( 2k + • • • + &n-iC^ n 1 ^)? 

k =0 

where £ = e 2?r */ n is a primitive n-th root of unity. 


Proof. 

Let co be any n -th root of unity. 

Consider the vector 

1 

co 

V = CO 2 


of- 


We show that the vector v is an eigenvector of A . 
We compute 


Av = 


a 0 

a\ 

• • • Cl n —2 

Ctfi —1 

l 

ao 

CO 

1 

R 

^3 

an -2 

a n -2 

@n —1 

1 . . 

R 

CO 

1 . . 

R 

a 2 

as 

a 0 

ai 

CL\ 

a 2 

a n ~i 

ao 


The first entry of the vector A v is 

do + a\CO + a 2 <j} + 
We define A to be this number. 


a n - 2 C 0 n 2 + a n -ico n 1 A 


co 


co 


co n 


n —1 


-1 


The second entry is 


a n -i + a o co + • • • + a n -zco n 2 + a n -2C0 n 

— ( CL n —\dJ 1 + CIq A * * * Cl n —^CO n ^ -|- Cl n _2(A) 1 )CO 

— + • • * + a n -%co + a n -2CO + a n -\co )co 

— Aco 


Similarly the /-th entry of the vector Av is 


















a n-i+ 1 + a n-i+ 2 CO + • • • + CL n -i C0 H 

— (fln-i+l CO n + a n -i +2 CO n Z+2 + * * • + a n -i co n i )&/ 

= /W _1 . 


Therefore we obtain 


A 




v4v = 


Acu 2 


= Av. 


Xaf- 1 


Since v is a nonzero vector, it follows that k is an eigenvalue of A and v is an eigenvector corresponding 
to X. 


The above argument holds for any n -th root of unity co. 

We take co — where k runs from 0 to n — 1. 

It follows that the vector 

1 

f* 

r2k 

Vk := 

£k(n— 1 ) 

is eigenvector corresponding to the eigenvalue 

hk := do -\- Cl I + #2<f 2 ^ + * * ' a n-2^ n ^ + a n -i^ n ^ 
for each & =0, — 1 . 


We claim that the vectors v £ are linearly independent. 

To see this, form a matrix whose column vectors are these vectors. That is, we consider 


"1 

1 

l 

1 

1 

1 

c 

C 2 

£H— 2 

1 

1 

c 2 

c 4 

to 

's' 

1 

to 

to 

's' 

1 

1 

c 3 

c 6 

£"3(b-2) 

£-3(/i—1) 

.1 

1 


< r(«-l)(«-2) 

£-(n-l)(n-l) 


This is the Vandermonde matrix and its determinant is 











det(fi) = l[(C J - n/0. 

i<j 

Thus, the matrix B is nonsingular, hence its column vectors are linearly independent. It follows that A k , 
k — 0, 1, . . ., fl are all the eigenvalues of A . Since the determinant is the product of all the eigenvalues 
of A , we have 


det(^4) 


n —1 


IP* 


n —1 


^(ao + ai£ k + ci2C 2k + * * 

k =o 


• + a n 


rk(n—l) 

lC 


), 


as required. This completes the proof. 


0 Let A be an n X n real matrix. Prove the followings: 

(a) The matrix ^4^4 T is a symmetric matrix. 

(b) The set of eigenvalues of A and the set of eigenvalues of A T are equal. 

(c) The matrix AA T is non-negative definite. 

(An n X n matrix B is called non-negative definite if for any n dimensional vector x, we have x T 5x > 0.) 

(d) All the eigenvalues of ^4^4 T is non-negative. 


Facts. 

Before the proofs, we first review several basic properties of the transpose of a matrix. 


For any matrices A and B so that the product AB is defined, we have ( AB ) T = B T A T 


^ We have (A T ) T — A for any matrix A . 

Also recall that the eigenvalues of a matrix A are the solutions of the characteristic polynomial 
Pa i x ) — det(v4 — xl ) . 

Proof. 

(a) The matrix AA T is a symmetric matrix. 

We compute (AA T ) T = (A T ) T A T = AA T and thus AA T is a symmetric matrix. 

(We used the Fact 1 and 2.) 

(b) The set of eigenvalues of A and the set of eigenvalues of A T are equal. 

We show that the characteristic polynomials of A and A T are the same, hence they have exactly same 
eigenvalues. 

Let Pa (x) and p A t (x) be the characteristic polynomials of A and A T , respectively. Then we have 
Pa ( x ) — det (A — xl ) = det(^4 — xI) T — det(^4 T — xl) = p A t (x). 


The first and last equalities are the definition of the characteristic polynomial. 

The second equality is true because in general we have det(5) = det(5 T ) for a square matrix B . This 
completes the proof of (b). 

(c) The matrix AA T is non-negative definite. 

Let x be an n dimensional vector. Then we have 

x T AA T x = (A T x) T (A T x) = \ \A T x\\ > 0, 


since a norm (length) of a vector is always non-negative. 

Thus A A T is non-negative definite. 

(d) All the eigenvalues of AA T is non-negative. 

Let X be an eigenvalue of AA T and let x be an eigenvector corresponding to the eigenvalue X . 
Then we compute 

x T AA T x = x T Xx — A||x||. 


Here the first equality follows from the definitions of the eigenvalue X and eigenvector x. In part (c), we 
proved that AA T is non-negative definite, hence we have X | |x| | > 0 . 

Therefore X > 0 and this completes the proof of (d). 

Related Question. 


Problem. A real symmetric n X n matrix A is called positive definite if 

x T Ax > 0 


for all nonzero vectors x in R* 2 . 

(a) Prove that the eigenvalues of a real symmetric positive-definite matrix A are all positive. 

(b) Prove that if eigenvalues of a real symmetric matrix A are all positive, then A is positive-definite. 


Let A be an n X n matrix. Suppose that y is a nonzero row vector such that y A = y. (Here a row vector means 
a 1 X n matrix.) Prove that there is a nonzero column vector x such that Ax — x. (Here a column vector means 
an n X 1 matrix.) 


Proof 1 .(Without the theory of eigenvalues) 


Let / be the n X n identity matrix. Then we have 

0i xn = yA - y = y(A - I), 



where 0 \ xn is the row zero vector. 

Taking the transpose, we have 

0»xi = 0? x „ = (y(A -/)) T 

= (A ~l) T y T - 

Since the vector y is nonzero, the transpose y T is a nonzero column vector. 
Thus, the above equality yields that the matrix (A — /) T is singular. 

It follows that the matrix A — I is singular as well. 

Hence there exists a nonzero column vector x such that 

(A -I)x = 0„xi, 


and consequently we have 


Ax = x 


for a nonzero column vector x. 


Proof 2. (Using the theory of eigenvalues) 

Taking the conjugate of the both sides of the identity y A — y , we obtain 

A T y T =y T . 

T 1 

Since y is a nonzero row vector, y is a nonzero column vector. 

It follows that 1 is an eigenvalue of the matrix A T and y T is a corresponding eigenvector. 

Since the matrices A and A T has the same eigenvalues , we deduce that the matrix A has an eigenvalue 1. 
(See part (b) of the post “Transpose of a matrix and eigenvalues and related questions. “.) 

Let x be an eigenvector corresponding to the eigenvalue 1 (by definition x is nonzero). Then we have 


Ax = x 


as required. 


0 (a) Let A be a real orthogonal n X n matrix. Prove that the length (magnitude) of each eigenvalue of A is 1. 

(b) Let A be a real orthogonal 3x3 matrix and suppose that the determinant of A is 1. Then prove that A has 1 
as an eigenvalue. 


Proof. 

(a) Prove that the length (magnitude) of each eigenvalue of A is 1 
Let A be a real orthogonal n X n matrix. 

Let X be an eigenvalue of A and let v be a corresponding eigenvector. 
Then we have 




Ay — Ay. 


It follows from this we have 


IMvf 



The left hand side becomes 




= (Av) (Av) 
= v T A T Av 


by definition of the length 
because A is real 

because A T A — I as A is orthogonal 
by definition of the length. 


It follows that we obtain 



W 2 



Since v is an eigenvector, it is non-zero, and hence || v|| ^ 0 . 
Canceling 11 v 11 , we have 

W 2 = i. 


Since the length is non-negative, we obtain 


W 1, 


as required. 

(b) Prove that A has 1 as an eigenvalue. 

Letv4 be a real orthogonal 3x3 matrix with det(v4) = 1. 

Let us consider the characteristic polynomial p{t) = det(yt — tl ) of A . 

The roots of p{t) are eigenvalues of A . 

Since A is a real 3x3 matrix, the degree of the polynomial p{t) is 3 and the coefficients are real. 
Thus, there are two cases to consider: 

^ there are three real eigenvalues a,/?, y , and 


there is one real eigenvalue a and a complex conjugate pair /?,/? of eigenvalues. 


Let us first deal with case 1. 

By part (a), the lengths of eigenvalues a,/?, y are 1. Since they are real numbers, we have 

a = ±1,/? = ±l,y = ±1. 

Recall that the product of all eigenvalues of A is the determinant of A . 

(For a proof, see the post “Determinant/trace and eigenvalues of a matrix 46 .) 

Thus we have 




afiy — det(v4) = 1. 


Thus, at least one of a, /?, y is 1. 


Next, we consider case 2. Again the lengths of eigenvalues a,/?,/? are 1. 
Then we have 


1 = det(v4) = afifi 

— a\fi\ 2 — a. 


Therefore, in either case, we see that A has 1 as an eigenvalue. 


0 Let A and B be square matrices such that they commute each other: AB — BA. 

Assume that A — B is a nilpotent matrix. Then prove that the eigenvalues of A and B are the same. 


Proof. 


Let N := A — B .By assumption, the matrix N is nilpotent. 

This means that there exists a positive integer n such that N n is the zero matrix O. 

Let A be an eigenvalue of B and let v be an eigenvector corresponding to A . That is, we have By — Ay 
andv ^ 0. 

We prove that A is also an eigenvalue of A . 

Note that since A and B commute each other, it follows that the matrices N and B — AI commute each 
other as well. 

Then we compute 


(A - AI) n — (N + B — AI) n 



where the second equality follows by the binomial expansion. (Note that the binomial expansion is true for 
matrices commuting each other.) 

Then we have 



since ( B — AI)y — 0 and N n = O . 


This implies that there exists an integer k,0<k<n — 1 such that 


u (A — AI) k Y y 0 and (A — AI) k+1 v — 0. 


It yields that (A — AI)u = 0 and u 7 ^ 0 , or equivalently Am — Am . 

Hence A is an eigenvalue of A . 

This proves that each eigenvalue of B is an eigenvalue of A . 

Note that if A — B is nilpotent, then B — A is also nilpotent. 

Thus, switching the roles of A and B , we also see that each eigenvalue of A is an eigenvalue of B . 
Therefore, the eigenvalues of A and B are the same. 

Let A be an n X n matrix. Suppose that A has real eigenvalues Ai, A 2 , • • •, A n with corresponding eigenvectors 
ui,u 2 , . ., u n . Furthermore, suppose that |Ai | > |A 2 | > * * * > |A n |. Let 

x 0 ClUi + C 2 u 2 H-b c n u n 

for some real numbers Ci, c 2 ,..., c n and Ci 7 ^ 0. Define x& + i = Ax.^ for k=0, 1, 2,\dots and let 


Pk 


Xfc • X fe+ i _ xT Xfe+ i 

Xfc-Xfc 


Prove that lim^oo f3 k = Ai. 


Proof. 


We have 



by applying the definition of x k successively. 
Then we have 


X* = A k x 0 

= A k (c\VLi +c 2 u 2 H- \~ c n u„) 

= C\A *Ul + C2A k VL2 + • • • + c n A k u n 
= ciX\ui + C2/I2U2 H-+ c n /\lu n , 


where the last step follows from the equalities A k u,- — l k \ij, which in turn follows from the definitions 
of eigenvalues and eigenvectors v4u z - — A fUf . 

Using the sum notation, we simply write it as 


n 



This formula is true for any nonnegative integer k. 


Let us next compute the dot product x £ • x ^. We have 





X£ • X£ = 




\ *=1 

n n 


J= 1 


= X!Z! c ' c / ; -^/ u ' ' u / 


Since A i is strictly larger than any other eigenvalues, we have 


lim 

k—>oo 



1 ifi = l 
0 if i = 2,3,... ,n. 


It follows that 


lim x k • x* = • ui. 

k —^oo 


Similarly, we compute 


-Xjt+1 = ( ^CiAfu, ) • ( ^CjX^Mj 


\ Z = 1 

« 72 


,7=1 


y]y]c ; c/2f2j +iu ; • u/ 


;=i /=! 


77 7Z / /I \ A: / n \ &+1 

*"SS"(fc) (fc) “ 


Thus we obtain 


lim X* • Xyt + i = A^ +1 c^ui • ui. 

k—> oo 


Combining these computations, we have 

lim^-Kx, x fc • x k+ 1 

lim Pk = -r:- 

k—>oo lim^oo x^ 

Af +1 C 2ui - Ui 
A^C^Ui • Ui 

= Al 


as required. This completes the proof. 




Eigenvectors and Eigenspaces 


Definition 


Let A be an n X n matrix. 

The eigenspace corresponding to an eigenvalue A of A is defined to be 
E\ = {x e C n | Ax = Ax}. 


Summary 


Let A be an n X n matrix. 

The eigenspace E\ consists of all eigenvectors corresponding to A and the zero vector. 
A is singular if and only if 0 is an eigenvalue of A. 

The nullity of A is the geometric multiplicity of A = 0 if A = 0 is an eigenvalue. 


Problems 


^ Let A be a 3 X 3 matrix. Suppose that A has eigenvalues 2 and —1, and suppose that u and v are eigenvectors 
corresponding to 2 and —1, respectively, where 


u — 

" 1 " 

0 

and v = 

"2" 

1 


-1 


0 


Then compute A 5 w, where w — 


7 ' 
2 

-3 


Solution. 

Since u is an eigenvector corresponding to the eigenvalue 2, we have 

Au = 2u. 

Similarly, we have 

Av = -v. 


From these, we have 

A 5 u = 2 5 u and Av = (—l) 5 v. 

To compute A 5 w, we first need to express w as a linear combination of u and v. Thus, we need to 
find scalars C\ , C 2 such that 






















W = CiU + C2 V. 


By inspection, we have 


‘ 7 ' 

2 

= 3 

‘ 1 ' 

0 

+ 2 

'2' 

1 

.-3. 


_-l_ 


.0. 


and thus we obtain C\ — 3 and C 2 = 2, 

We compute A 5 w as follows: 

A 5 w = A 5 (3u + 2v) 

= 3A 5 u + 2^4 5 v 
— 3 • 2 5 u + 2 • (— l) 5 v 
= 96u — 2v 



' 1 ' 


"2" 

96 

0 

- 2 

1 


-1 


0 


92 

-2 

-96 


Therefore, the result is 


A 5 w = 


92 

-2 


-96 


Let A — 


1 2 
-1 4 

2 -4 


. The matrix A has an eigenvalue 2. Find a basis of the eigenspace E 2 


corresponding to the eigenvalue 2. 
{The Ohio State University) 


Solution. 

By definition, the eigenspace E 2 corresponding to the eigenvalue 2 is the null space of the matrix A — 21 
That is, we have 

E 2 = AT (A - 21). 

We reduce the matrix A — 21 by elementary row operations as follows. 



















A-21 = 


R-2—Rl 

R3+2R 1 


-12 1 

-12 1 


2 


■4 

-2. 




-1 

2 

1" 

-Ri 

'1 

-2 

-1' 

0 

0 

0 

- > 

0 

0 

0 

0 

0 

0 


0 

0 

0 


Thus, the solutions x of (A — 21) x = 0 satisfy X\ — 2x2 + X 3 . 
Thus, the null space J\T(A — 21) consists of vectors 



2 x 2 + X 3 


"2" 


"1" 

X = 

1 

* * 

CO to 

= *2 

1 

.0. 

+ X3 

0 

_1_ 


for any scalars x 2 , x 3 . 


Hence we have 




'2' 


' 1 ' 

\ 

E 2 — M{A — 27) = Span 


1 

? 

0 




.0. 


.1. 

/ 



' 2 ' 


' 1 " 

It is straightforward to see that the vectors 

1 

? 

0 


0 


1 . 


of£V 


are linearly independent, hence they form a basis 


Thus, a basis of E 2 is 


2 

1 

0 


1 

0 

1 


Q Let 


"0 0 0 0 " 

, 1111 

A = 

0 0 0 0 

_1 1 1 1 _ 

One of the eigenvalues of the matrix A is A = 0. Find the geometric multiplicity of the eigenvalue A = 0. 


Solution (a). Eigenvalues of A and algebraic multiplies 

Eigenvalues and their algebraic multiplicities are determined by the characteristic polynomial p(t) of A . 
By definition, the characteristic polynomial of A is p(t) — det(v4 — tl) . 

We have 

































p(t ) = det(A — tl ) 

-t 0 0 0 

1 l -/ 1 1 1 

0 0-^0 
1 1 1 1 -t 


= 


1 - 1 1 1 
0 -t 0 
1 1 1 - 1 


= -tl -t 


1 -t 1 
1 1 -t 


= ' 2 ((l-o 2 -l) 

= t\t 2 - 2t ) 

= ? 3 (?- 2). 


by the first row cofactor expansion 


by the second row cofactor expansion 


Thus the characteristic polynomial is 

p(t) = t z (t-2). 

From this, the eigenvalues ofv4 are 0 and 2 with algebraic multiplicities 3 and 1, respectively. 

Solution (b). Geometric multiplicity 
We give two solutions for part (b). 

First Solution (b). (Finding the rank first) 

Recall that the geometric multiplicity of A is the dimension of the eigenspace Ex — Af (A — A/) . That 
is, the geometric multiplicity of X is the nullity of the matrix A — XI . 

Let us now consider the case X = 0 . 

We first find the rank of A — 0/ = A as follows. 


'0 

0 

0 

o' 


'0 

0 

0 

o' 


"l 

1 

1 

r 

1 

1 

1 

1 

^4— ^2 

1 

1 

1 

1 

R i ^R 2 

0 

0 

0 

0 

0 

0 

0 

0 

- > 

0 

0 

0 

0 

-* 

0 

0 

0 

0 

.1 

1 

1 

1. 


.0 

0 

0 

0. 


.0 

0 

0 

0. 


The last matrix is in reduced row echelon form. 
Hence the rank of A is 1. 

The rank-nullity theorem says that 


rank of A + nullity of A — 4. 

Thus, the nullity oiA — A — QI is 3, and hence the geometric multiplicity of X — 0 is 3. 

Second Solution (b). (Finding a basis of the eigenspace) 

In this solution, we find a basis of the eigenspace E o. 

By definition E o = Af{A — 0 1) — Af(A) . 
















Thus, the eigenspace E o is the null space of the matrix A . 
We solve the equation Ax = 0 as follows. 

The augmented matrix of this equation is 



1 

o 

0 

0 

0 

1 

o 


" 0 

0 

0 

0 

0 " 


" 1 

1 

1 

1 

1 0] = 

i 

1 

1 

1 

0 

R4-R2 

1 

1 

1 

1 

0 

R 1 4 -^R 2 

0 

0 

0 

0 






- » 






- » 





0 

0 

0 

0 

0 


0 

0 

0 

0 

0 


0 

0 

0 

0 


1 

1 

1 

1 

0 


0 

0 

0 

0 

0 


0 

0 

0 

0 


Hence the solution satisfies 


and the general solution is 


x = 


X\ = —X 2 - x 3 - X 4 


-1 

1 

X 

to 

1 

X 

CO 

1 

X 


'- 1 ' 


'- 1 " 


'-1 

X 2 

= *2 

1 

0 

+ -^3 

0 

+ X4 

0 

0 

*3 

1 

X 4 


. 0 . 


. 0 . 


. 1 . 


Therefore, the eigenspace is 

E o = Af(A) 


= < 


/ 




"- 1 ' 


■- 1 " 

— A 


1 


0 


0 

< x e C 4 

X = X 2 

0 

+ -^3 

1 

+ X4 

0 



. 0 . 


. 0 . 


. 1 . 


= Span 


/ 

'- 1 ' 


"- 1 " 



\ 


1 


0 


0 


1 

0 

5 

1 

? 

0 

1 


. 0 . 


O 

_ 1 


. 1 . 

J 


for any X 2 ,X 3 ,X 4 G C 


Thus the set 


r 





'-1 

\ 

1 

1 


0 


0 

> 

| 

0 


1 


0 


l 

. 0. 


1 

0 

1_ 


. 1. 

> 


is a spanning set of E o, and it is straightforward to check that the set is linearly independent. 
Hence this set is a basis of E o, and the dimension of E o is 3. 

The geometric multiplicity of A — 0 is the dimension of E o by definition. 

Thus, the geometric multiplicity of X is 3. 


Comment. 


These are Quiz 12 problems for Math 2568 (Introduction to Linear Algebra) at OSU in Spring 2017. 












































I could have combined these two problems into one problem, and just asked to find eigenvalues and 
algebraic/ geometric multiplicities for each eigenvalue. 

The reason I didn’t do is that I wanted to rescue students who didn’t get a correct answer in part (a) for 
some reasons. 

Also, since one of the eigenvalue is given in (b), students could use this information to double check their 
solutions in (a). 

(At least if you didn’t get the eigenvalue 0, you made a mistake somewhere.) 

Let A and B be n X n matrices. Suppose that these matrices have a common eigenvector x. Show that 

det(AB - BA) = 0. 


Steps. 

Q Write down eigenequations of A and B with the eigenvector x. 


Show that AB-BA is singular. 


A matrix is singular if and only if the determinant of the matrix is zero. 


Proof. 

Let a and fi be eigenvalues of A and B such that the vector x is a corresponding eigenvector. 
Namely we have Ax = ax and Bx = fix . 

Then we have 

(AB — BA)x — ABx — BAx = A(fix) — B(ax) 

= fiAx — aBx — fia — afi — 0. 

By the definition of eigenvector, x is a non-zero vector. Thus the matrix AB — BA is singular. 
Equivalently the determinant of AB — BA is zero. 

Comment. 

This is a simple necessary condition that A and B have a common eigenvector. 

Here are few derived questions. 

Q Is this a sufficient condition? 

If so prove it. 




If not give a counterexample, 







and find a necessary and sufficient condition. 


© 


Suppose that A is a diagonalizable matrix with characteristic polynomial 


f A ( A) = A 2 (A-3)(A + 2) 3 (A-4) 3 . 


(a) Find the size of the matrix A. 

(b) Find the dimension of £ 4 , the eigenspace corresponding to the eigenvalue A = 4. 

(c) Find the dimension of the nullspace of A. 

(Stanford University) 

Hint. 


If a matrix is diagonalizable, then the algebraic multiplicity of an eigenvalue is the same as the 
geometric multiplicity of the eigenvalue. 


Recall that the geometric multiplicity of an eigenvalue is the dimension of the eigenspace of the 
eigenvalue. 



Solution. 


If A is an n X n matrix, then its characteristic polynomial is of degree n. Since the degree of f a is 9 
, the size of A is 9 X 9. 

(b) Find the dimension of £ 4 , the eigenspace corresponding to the eigenvalue A = 4. 

The algebraic multiplicity of the eigenvalue A = 4 is 3. (This is the number of times that the factor 
A — 4 appears in .) 

Since A is diagonalizable, the algebraic multiplicity is the same as the geometric multiplicity for each 
eigenvalue. 

The geometric multiplicity is the dimension of the eigenspace by definition. Thus the dimension of £4 
is 3. 

(c) Find the dimension of the kemel(nullspace) of A. 

Note that ker (A) — {x € K 9 | Ax — 0} = {x £ R 9 | (A — 01g)x = 0} = £7o. Thus the 
kernel of A is the same as the eigenspace £0 corresponding to the eigenvalue A = 0. 

Since A is diagonalizable, the geometric multiplicity of A = 0, which is the dimension of the 
eigenspace £0 — ker (A), is the same as the algebraic multiplicity of A = 0. 

Thus the dimension of ker (A) is 2. 


© Let A be a square matrix and its characteristic polynomial is give by 


Find the rank of A. 

(The Ohio State University) 



Solution. 

Note that the degree of the characteristic polynomial p{t) is the size of the matrix A . 
Since the degree of p(t) is 3 + 2 + 4 + 1 = 10 , the size of the matrix A is 10 X 10. 

From the characteristic polynomial, we see that the eigenvalues of A are 1, 2, 3, 4 . 

In particular, 0 is not an eigenvalue of A . 

Hence the null space of A is zero dimensional, that is, the nullity of A is 0. 

By the rank-nullity theorem, we have 

rank of ^4 + nullity of A — 10. 


Hence the rank of A is 10. 


(a) Let 



«12 

«22 


be a matrix such that an + an = 1 and a 2 i + CL 22 — 1- Namely, the sum of the entries in each row is 1. (Such 
a matrix is called (right) stochastic matrix .) Then prove that the matrix A has an eigenvalue 1. 

(b) Find all the eigenvalues of the matrix 


B = 


0.3 0.7 
0.6 0.4 


(c) For each eigenvalue of B, find the corresponding eigenvectors. 



For (a), consider the vector x = 


For (b), use (a) and consider the trace of B and its relation to eigenvalues. 

For this relation, see the problem Determinant/trace and eigenvalues of a matrix. 




Solution. 

(a) Prove that the matrix A has an eigenvalue 1. 


Let x = 


1 

1 


and we compute 


A.x. — 

an a\2 


' 1 ' 

— 

Clll + d\2 

— 

1 ' 


_«21 «22 . 


.1. 


_^21 + CL 22 _ 


. 1 . 






















This shows that A has the eigenvalue 1. 

(b) Find all the eigenvalues of the matrix B . 

Note that the matrix B is of the type of the matrix in (a). 

Thus the matrix B has the eigenvalue 1. Since B is 2 by 2 matrix, it has two eigenvalues counting 
multiplicities. To find the other eigenvalue, we note that the trace is the sum of the eigenvalues. 

Thus we have 


tr (B) = 0.3 + 0.4= 1+A, 

where A is the second eigenvalue. Hence another eigenvalue is A — —0.3 . 


(c) For each eigenvalue of B , find the corresponding eigenvectors. 
By solving [B — /)x = 0 and ( B + 0.3/)x = 0 , we find that 


1 

1 


t 


and 



t 


for any nonzero scalar t are eigenvectors corresponding to eigenvalues 1 and —0.3, respectively. 


Comment. 

For some specific matrices, we can find eigenvalues without solving the characteristic polynomials like we 
did in part (b). 


G Let A be an n X n matrix. Suppose that Ai, A 2 are distinct eigenvalues of the matrix A and let Vi, V 2 be 
eigenvectors corresponding to Ai, A 2 , respectively. Show that the vectors Vi, V 2 are linearly independent. 


Proof. 

To show that the vectors v i, v 2 are linearly independent, consider a linear combination 

civi + c 2 v 2 = 0, (*) 

where C\ , C 2 are complex numbers. 

Our goal is to show that C\ — C 2 — 0 . 

By the definitions of eigenvalues and eigenvectors we have 

Av 1 =X\Y\ and^4v2 =^ 2 V 2 . 


Multiplying A and (*), we have 


0 = v4 • 0 

= A(c 1 v 1 + c 2 v 2 ) 

= ciAvi + C2AV2 

= C1A1V1 + c 2 ^ 2 v 2 . (**) 







Now we multiply A 2 and (*) and we obtain 

0 = c\A 2 v 1 + C 2 /I 2 V 2 . 

We subtract (**) from the last expression, and get 

0 = (C1A2V1 + C2/I2V2) - (ciAivi + C2/I2V2) 

= ci(A 2 -/li)vi. 

Recall that an eigenvector is by definition a nonzero vector, and hence v i ^ 0. 

Thus we must have 

Ci (A 2 ~ A i) = 0. 

Since A i and A 2 are distinct, we must have C\ — 0. 

Substituting C\ = 0 into (*), we also see that C 2 = 0 since V 2 7 ^ 0. 

Therefore, the values of C\ and C 2 are both zero, and hence the eigenvectors v 1 , v 2 are linearly 
independent. 

Related Question. 

As an application of this problem, try the next problem, which provides a geometrical meaning of 
eigenvalues. 


9 9 

Problem. Let T : R —> R be a linear transformation and let A be the matrix representation of T 
with respect to the standard basis of R 2 . 

Prove that the following two statements are equivalent. 

0 There are exactly two distinct lines L \, L 2 in R 2 passing through the origin that are mapped 
onto themselves: 

T(L 1 )=L 1 and T(L 2 )=L 2 . 

0 The matrix A has two distinct nonzero real eigenvalues. 


For a proof of this problem, see the post “A Linear Transformation Preserves Exactly Two Lines If and 
Only If There are Two Real Non-Zero Eigenvalues 44 . 

Let A and B be an n X n matrices. Suppose that all the eigenvalues of A are distinct and the matrices A and B 
commute, that is AB — BA. Then prove that each eigenvector of A is an eigenvector of B. 


Hint. 

Each eigenspace for A is one dimensional. 




Proof. 

Since A has n distinct eigenvalues, the characteristic polynomial for A factors into the product of degree 1 
polynomials. 

Thus, the algebraic multiplicity of each eigenvalue is, 1 and hence the geometric multiplicity is also 1. 

(The geometric multiplicity is always less than or equal to the algebraic multiplicity and greater than 0 by 
definition.) 

Thus the dimension of each eigenspace, which is the geometric multiplicity, is 1. 


Let A be an eigenvalue of the matrix A and let x be the eigenvector corresponding to A . 

Since the eigenspace Ex for A is one dimensional and xGf] is a nonzero vector in it, the vector x is a 
basis. 

That is, we have Ex — {tx | t £ C} . 


Now we multiply Ax = Ax by the matrix B on the left and obtain 

BAx — XBx 

ABx — XBx since AB — BA. 

This implies that5x ^ Ex — \p^ \ t ^ C} . Therefore there exists t £ C such that5x = tx . 

Hence the vector x is also an eigenvector corresponding to the eigenvalue t of the matrix B . 

This completes the proof. 


Let H and E be n X n matrices satisfying the relation HE — EH — 2 E. Let A be an eigenvalue of the matrix 
H such that the real part of A is the largest among the eigenvalues of H . Let x be an eigenvector corresponding to 
A. Then prove that Ex — 0. 

Proof. 


Using the given relation we have 


HEx — EHx — 2 Ex 
=> HEx — A Ex — 2 Ex 

=> HEx = (A + 2 )Ex. (*) 

Let v = Ex and assume that v^O. Then we have from (*) 

Hw — (A + 2)v for the nonzero vector v. 

This means that A + 2 is an eigenvalue of the matrix H and v is a corresponding eigenvector. 

However, the real part of the eigenvalue A + 2 is greater than that of A. This contradicts the choice of 
A. 

Therefore the vector v = Ex must be zero. 


Comment. 


You might wonder why we consider the relation HE — EH — 2E. 

In fact, this relation is a part of the relations of the Lie algebra si(2). 

Although you don’t have to know anything about Lie algebra to solve this problem, 

it might be good to know that these computations are actually used in a more advanced mathematic. 


^Let 



-1 

2 

5 


Pick your favorite number a. Find the dimension of the null space of the matrix A — al, where / is the 3 X 3 
identity matrix. Your score of this problem is equal to that dimension times five. 

(The Ohio State University) 


Solution. 

Note that if a is not an eigenvalue of the matrix A , then the matrix A — al is nonsingular, that is, the null 
space of A — al is the zero space, and the dimension is 0. 

Thus to obtain a nonzero point, your favorite number should be an eigenvalue. 

Let us find the eigenvalue. The characteristic polynomial p{t) of A is 

p(t) = det (A — tl ) 

5 -1 2 -1 

2 2 - t 2 

-1 2 5 -t 


(By the first row cofactor expansion.) 

= (5 - t)(t 2 - 7t + 6) - 2(-2 1 + 12) - (-1 + 6) 
= - 1 3 + 12 1 2 - 36 1 
= -t(t-6) 2 . 


2 2 
-1 5 - t 


+ (-l) 


2 

-1 


2 - t 
2 



Eigenvalues are roots of the characteristic polynomial. 

Thus eigenvalues of A are 0 and 6 , with algebraic multiplicities 1 and 2, respectively. 

Since the matrix A is a real symmetric matrix, it is diagonalizable. Thus the geometric multiplicity of each 
eigenvalue is equal to its algebraic multiplicity. 

Therefore the dimension of the null space of A — 61 , which is the geometric multiplicity of the 
eigenvalue 6 is equal to 2, and the dimension of the null space of A is 1. 


Thus, if you take a — 6, you get 2 X 5 = 10 points. If your favorite number is a = 0, then your score is 
1x5 = 5 points. The other choice of a will be zero points. 


Comment. 












If you didn’t notice that the given matrix is a real symmetric matrix, then you need to find eigenvectors 
corresponding to eigenvalues 6 and to show that the eigenspace has dimension 2. 




Let A — 


" 1 

-14 

4 " 


" 4 " 

-1 

6 

-2 

and v — 

-1 

_ —2 

24 

—7_ 


_ —7_ 


. Find A 10 v. You may use the following information 


without proving it. The eigenvalues of A are —1, 0,1. The eigenspaces are given by 


E_i — Span < 


3 

-1 

-5 


> , Eq — Span < 


-2 

1 

4 


> , Ei — Span < 


(The Ohio State University) 

Solution. 

Since A has three distinct eigenvalues, the eigenvectors 


-4 

2 

7 


> . 


. 3 . 


'-2 1 


-_4- 

-1 

? 

1 

? 

2 

-5 


4 


7 


form a basis of R 3 . 

Thus, we can express the vector v as a linear combination 



■ 3 - 


'-2 ' 


■-4' 

V = X 

-1 

-5 

+y 

1 

4 

+ z 

2 

7 


for some scalars x, y, z. 

We determine the values of x,y, z by Gauss-Jordan elimination. 
The augmented matrix of the system is 


3-2-4 4 

- 112-1 
-547-7 


Applying the elementary row operations, we obtain 
































1 3 

-2 

-4 

4" 


' 1 

-1 

-2 

1 1 

-1 

1 

2 

-1 


3 

-2 

-4 

4 

-5 

4 

7 

-7 

Then —R l 

-5 

4 

7 

-7 


r 2 -^r x 

■1 -1 -2 1- 

^1+^2 

R 2, +R 2 

■— 

<N 

O 

o 

■_ 

-► 

0 12 1 

-► 

0 12 1 


0 -1 -3 -2 

Then -R 2 

0 0 11 


ri o o 2 
oio-i 
0 0 1 1 


So we have 


and the linear combination is 


x = 2 ,y = - 1, z = 1, 


v — 2 


1 3 1 


'-2 1 


'-4 " 

-1 

- 

1 

+ 

2 

-5 


4 


7 


Recall that if! is an eigenvalue and u is a corresponding eigenvector of A, then we have 


T 0 u = l 10 u. 


Using this property, we compute 


A 10 v = A 10 * 


/ 

. 3 . 


'-2" 


- -4 1 

\ 

2 

-1 

- 

1 

+ 

2 


\ 

-5 


4 


7 

/ 



1 3 1 


'-2 ' 


-_ 4 - 

Z4 10 

■ 

l D 

1 1 

_■ 

O 

1 

1 

4 

+t° 

2 

7 



1 3 1 


'-2 " 


'-4 " 

2 (- l) 10 

-1 

-5 

-o 10 

1 

4 

+ (i) 10 

2 

7 



1 3 1 


'-4 ' 

2 

-1 

+ 

2 


-5 


7 


2 

0 

-3 


In summary, we obtain 











































A 10 v = 



m Let A, B,C are 2 X 2 diagonalizable matrices. The graphs of characteristic polynomials of A , B , C are shown 
below. The red graph is for A, the blue one for B, and the green one for C . From this information, determine the 
rank of the matrices A , B , and C . 



Hint. 

Observe that a null space a matrix M is the same as the eigenspace E o corresponding to the eigenvalue 
X — 0 (if 0 is an eigenvalue). 

So what you need to see is whether the graphs pass though the origin. 

Also, since the matrices are diagonalizable, the algebraic multiplicities are the same as the geometric 
multiplicities (the dimension of the eigenspace). 

You can determine the algebraic multiplicities from the graph by looking at whether the graphs are 
tangential to x -axis or not. 

Solution. 

We first determine the nullities of the matrices, and then using the rank-nullity theorem we obtain the 
ranks. 

In general the nullity of any matrix M is the dimension of the null space 

N{M) = {x e M 2 | Mx = 0} 

and the null space is the same as the eigenspace 

Eq = {x G K 2 | Mx = Ox = 0} 











corresponding to the eigenvalue >1 = 0 (if any). 

From this observation, we see that the nullity of a matrix M is the geometric multiplicity of the eigenspace 
E o associated to the eigenvalue 0 (if any). 

The Nullity of the Matrix A 
Now we determine the nullity of the matrix A . 

The graph of the characteristic polynomial Pa (A) of A passes through the origin (0, 0). 

Thus A — 0 is a root of p a (A) and hence A — 0 is an eigenvalue. 

Since the X-axis is tangential to the graph of Pa (A) , the algebraic multiplicity of A — 0 is 2. Since the 
matrix A is diagonalizable, the geometric multiplicity is the same as the algebraic multiplicity. Therefore 
the nullity of A is 2 . 

The Nullity of the Matrix B 

Next, the graph of characteristic polynomial ps (A) of B does not pass through the origin (0, 0) . Thus 
A — 0 is not an eigenvalue of B . This yields that the matrix B — B — 01 is nonsingular matrix, and 
hence we have the null space J\f {B) = {0} and the nullity of B is zero. 

The Nullity of the Matrix C 

Third, the origin (0, 0) is on the graphs of characteristic polynomial pc(A) of C, but the X-axis is not 
tangential to the graph of pc (A) • 

Therefore the algebraic (hence geometric) multiplicity of A — 0 is 1 . Thus the nullity of C is 1 . 

Ranks by the Rank-Nullity Theorem 
Finally, using the rank-nullity theorem 


rank + nullity = 2, 

we obtain the rank of A , 5, C are 0, 2,1, respectively. 

Let C be a 4 X 4 matrix with all eigenvalues A = 2,-1 and eigensapces 


E 2 — Span < 


1 

1 

1 

1 



/■ 

"1" 


"1" 

\ 

> and E_i = Span < 


2 

1 

5 

1 

1 

> 

> 

< 

_ 1 _ 


2 

> 


Calculate C 4 u for u = 


6 

8 

6 

9 


(The Ohio State University) 


if possible. Explain why if it is not possible! 


Solution. 


Let 














' 1 ' 


V 


'1" 

1 


2 


1 

1 

,V 2 = 

1 

V 3 = 

1 

.1. 


.1. 


.2. 


The vector v i is an eigenvector corresponding to eigenvalue A — 2 and the vectors V2, V2 are 
eigenvectors corresponding to A — — 1. 


If the vector u is a linear combination of v 1, V2 , V3 , then we can compute Cu and hence C 4 u. 
So let us first determine whether 


X1V1 + x 2 v 2 + X3V3 = u 


has a solution or not. 

We use Gauss-Jordan elimination method to find the solution. 
The augmented matrix for this system is 


1 

1 

1 

6 " 

1 

2 

1 

8 

1 

1 

1 

6 

1 

1 

2 

9 


By elementary row operations, this matrix reduces to 


1 

0 

0 

1 " 

0 

1 

0 

2 

0 

0 

1 

3 

O 

_1 

0 

0 

0 


Thus the solution is 

xi = 1 , x 2 = 2 ,x 3 = 3 

and we have the linear combination 

v 1 + 2 v 2 + 3 v 3 = u. 


Now we have 

C 4 u = C 4 vi + 2C 4 V2 + 3C 4 V3 

= 2 4 vi + 2 (— 1 ) 4 v 2 + 3 (— 1 ) 4 v 3 


' 1 ' 


" 1 " 


" 1 " 

1 

+ 2 

2 

+ 3 

1 

1 


1 


1 

.1. 


.1. 


.2. 


21 

23 
21 

24 






















Therefore we obtain 


c 4 


21 

23 
21 

24 


^ Let A be an n X n matrix and let c be a complex number. 

(a) For each eigenvalue A of A, prove that A + c is an eigenvalue of the matrix A + cl, where / is the identity 
matrix. What can you say about the eigenvectors corresponding to A + c? 

(b) Prove that the algebraic multiplicity of the eigenvalue A of A is the same as the algebraic multiplicity of the 
eigenvalue A + c of A + cl are equal. 

(c) How about geometric multiplicities? 

Proof. 

(a) A + c is an eigenvalue of A + cl. 

Let x be an eigenvector corresponding to the eigenvalue A. Then we have 


Ax — Ax. 


It follows that we have 


(A + cl)x — Ax + cx 
= Ax + cx 
= (A + c)x. 

Thus, we obtain 

(A + cl)x — (A + c)x, 


where x is a nonzero vector. 

Hence A + c is an eigenvalue of the matrix A + cl , and x is an eigenvector corresponding to 
A — c. 

In summary, if A is an eigenvalue of A and x is an associated eigenvector, then A + c is an 
eigenvalue of A + cl and x is an associated eigenvector corresponding to A + c. 

(b) Algebraic multiplicities are the same 

Let p(t) = det (A — tl ) be the characteristic polynomial of A. 

Let q{t) be the characteristic polynomial of the matrix A + cl. 

Then we have 

q(t) = det ((A + cl) — t) — det (A — (t — c )) 

= p(t - c ). 

Let Ai,. . ., Ajfe be distinct eigenvalues of A with algebraic multiplicities ni,. . ., n^, respectively. 
Then we have 


k 

p(i) = ±n(t-A i r. 

i =1 




It follows that we have 

q(t) = p{t - c ) 

k 

= ± 11 ^ - c ~ \) n ' 

i=l 

=±n(t-(A i +c))”‘. 

i =1 


From the last equation, we read that the eigenvalues of the matrix A + cl are \ + c with algebraic 
multiplicity rti for i — 1 ,..., k. 

Thus, geometric multiplicities of A and A — c are the same. 

(c) How about geometric multiplicities? 

From part (a), we know that eigenvectors of A are eigenvectors of A — c. 

Reversing the argument, the eigenvectors of A + c are eigenvectors of A. 

Thus, the eigenvectors correspond one to one, and the eigenspace of A is the same as the eigenspace 
of A + c. 

Hence their geometric multiplicities are the same. 

Applications 

As applications of this problem, consider the following problems. 



Problem. Find all the eigenvalues and eigenvectors of the matrix 

"3 9 9 9" 

4 9 3 9 9 

A = 

9 9 3 9 

9 9 9 3 


This is a problem of a Linear Algebra final exam at Harvard University. 

For a solution of this problem, see the post “Eigenvalues and eigenvectors of matrix whose diagonal 
entries are 3 and 9 elsewhere 46 . 


Problem. Find the determinant of the following matrix 


A 


6 2 2 2 2 
2 6 2 2 2 
2 2 6 2 2 
2 2 2 6 2 
2 2 2 2 6 






This is also a problem of a Linear Algebra final exam at Harvard University. 

See the post “Determinant of matrix whose diagonal entries are 6 and 2 elsewhere” for a solution. 


$ Find all the eigenvalues and eigenvectors of the matrix 

'3 9 9 9 

. 9 3 9 9 

A — 

9 9 3 9 

9 9 9 3 


{Harvard University ) 


Proof. 

(a) A + c is an eigenvalue of A + cl. 

Let x be an eigenvector corresponding to the eigenvalue A . Then we have 

Ax = Ax. 


It follows that we have 

(A + cl)x — Ax + cx 
— Ax + cx 
— (A + c)x. 

Thus, we obtain 


(A + cl)x = (A + c)x, 


where x is a nonzero vector. 

Hence A + c is an eigenvalue of the matrix A + cl , and x is an eigenvector corresponding to A — c . 

In summary, if A is an eigenvalue of A and x is an associated eigenvector, then A + c is an eigenvalue of 
A + cl and x is an associated eigenvector corresponding to A + c . 

(b) Algebraic multiplicities are the same 

Let p{t) — det(v4 — tl ) be the characteristic polynomial of A . 

Let q{t) be the characteristic polynomial of the matrix A + cl . 

Then we have 

q{t) = det ((A + cl) — t) — det (A — (t — c) ) 

= p{t-c). 

Let A i,..., A k be distinct eigenvalues of A with algebraic multiplicities n \,..., n k , respectively. 

Then we have 








It follows that we have 

q(t) =p{t-c ) 

= ~ c - Xj) n ‘ 

i =1 

=±n(/-a,+c))"'. 

i=l 


From the last equation, we read that the eigenvalues of the matrix A + cl are A / + c with algebraic 
multiplicity n / for i = 1 ,..., k . 

Thus, geometric multiplicities of A and A — c are the same. 

(c) How about geometric multiplicities? 

From part (a), we know that eigenvectors of A are eigenvectors of A — c . 

Reversing the argument, the eigenvectors of A + c are eigenvectors of A . 

Thus, the eigenvectors correspond one to one, and the eigenspace of A is the same as the eigenspace of 

A c. 

Hence their geometric multiplicities are the same. 

Applications 

As applications of this problem, consider the following problems. 


Problem. Find all the eigenvalues and eigenvectors of the matrix 


3 

9 

9 

9 

9 

3 

9 

9 

9 

9 

3 

9 

9 

9 

9 

3 


This is a problem of a Linear Algebra final exam at Harvard University. 

For a solution of this problem, see the post “Eigenvalues and eigenvectors of matrix whose diagonal entries 
are 3 and 9 elsewhere". 


Problem. Find the determinant of the following matrix 


A 


6 2 2 2 2 
2 6 2 2 2 
2 2 6 2 2 
2 2 2 6 2 
2 2 2 2 6 






This is also a problem of a Linear Algebra final exam at Harvard University. 

See the post “Determinant of matrix whose diagonal entries are 6 and 2 elsewhere” for a solution. 


€> Find the determinant of the following matrix 


A = 


6 

2 

2 

2 

2 


2 2 
6 2 
2 6 
2 2 
2 2 


2 2 
2 2 
2 2 
6 2 
2 6 


(Harvard University) 


Hint. 

Computing the determinant directly by hand is tedious. 

So use the fact that the determinant of a matrix A is the product of all eigenvalues of A . 


However, finding the eigenvalue of A itself is as complicated as computing the determinant of A . 
Instead, first determine the eigenvalues of B — A — 4J . 

Then use the fact that if A is an eigenvalue of B , then A + 4 is an eigenvalue of A . 


For a proof of this fact, see the post “Eigenvalues and algebraic/geometric multiplicities of matrix 

A + cI “. 


Solution. 


Let B — A — 4J , where / is the 5 X 5 identity matrix. 
Then every entry of B is 2 . 


By elementary row operations, we can reduced the matrix B into 


'2 

2 

2 

2 

to 

_1 


'1 

1 

1 

1 

r 

2 

2 

2 

2 

2 


0 

0 

0 

0 

0 

2 

2 

2 

2 

2 

-A 

0 

0 

0 

0 

0 

2 

2 

2 

2 

2 


0 

0 

0 

0 

0 

.2 

2 

2 

2 

2. 


.0 

0 

0 

0 

0. 


Thus, the rank of B is 1 , hence the nullity is 4 by the rank-nullity theorem. 
It follows that 0 is an eigenvalue of B and its geometric multiplicity is 4. 


Since all entries of B are equal, we compute 









"r 


V 

1 


1 

1 

= 10 

1 

1 


1 

. 1 . 


. 1 . 


This yields that 10 is an eigenvalue of B and the vector [11 111 ] T is an eigenvector corresponding to 


10 . 


Combining these observations, we see that the matrix B has eigenvalues 0 and 10 with (algebraic) 
multiplicities 4 and 0, respectively. 

Since A = B + 4/ , the eigenvalues of A are 4 and 14 with algebraic multiplicities 4 and 0, 
respectively. 

(See Problem Eigenvalues and algebraic/geometric multiplicities of matrix A + cl .) 


The determinant of A is the product of all the eigenvalues of A (counting multiplicities). Thus we have 

det(^) = 4 4 • 14 = 3584. 


© Find all eigenvalues of the matrix 


A = 


0 iii 
i 0 i i 
i i 0 i 
iii 0 


where i — For each eigenvalue of A, determine its algebraic multiplicity and geometric multiplicity. 


Proof. 

We use an indirect method to find the eigenvalues of A as follows. 

Let B = A + il . Then note that if X is an eigenvalue of B , then X — i is an eigenvalue of A . 

Furthermore, the algebraic (respectively, geometric) multiplicity of X is the same as the algebraic 
(respectively, geometric) multiplicity of X — i . 


We reduced the matrix B as follows: 


i 

i 

i 

i 

R 2 R i 

i 

i 

i 

i 


'1 

1 

1 

ll 

i 

i 

i 

i 

R 3 R i 

R 4 R i 

0 

0 

0 

0 

f*l 

0 

0 

0 

o 

i 

i 

i 

i 

-» 

0 

0 

0 

0 

-> 

0 

0 

0 

0 

_i 

i 

i 

i _ 


.0 

0 

0 

0. 


.0 

0 

0 

0. 


It follows that the rank of B is 1, and thus the nullity of B is 3 by the rank-nullity theorem. 
















Since the nullity is greater than 0, it yields that 1 — 0 is an eigenvalue of A . 


Recall that the eigenspace E o of X — 0 is the null space of B . 

Hence the geometric multiplicity of A — 0 is the nullity of B , thus it is 3. 
It follows that the algebraic multiplicity of A — 0 is either 3 or 4. 


Recall that the sum of all eigenvalues is equal to the trace of B , which is tr(i?) = 4 i . 

Thus, we see that A — 0 is not the only eigenvalue. 

There can be only one more eigenvalue of B since the sum of algebraic multiplicities of all eigenvalues of 
B is 4. 

It follows that 4 i is an eigenvalue of B with algebraic multiplicity 1, and hence the geometric multiplicity 
is also 1. 

It yields tha the algebraic multiplicity of X — 0 must be 3. 


Another way to see that 4 i is an eigenvalue of B is to notice that the vector 

" 1 " 

1 

1 

1 


is an eigenvector. In fact, we have 



(Since all the entries are the same, it might not so difficult to notice this.) 


In summary so far, we have obtained that eigenvalues of B are 0 and 4 i with the algebraic/geometric 
multiplicities are 3 and 1, respectively. 

Since A = B — il , it follows that the eigenvalues of A are 0 — i — —i and 4 i — i = 3/ with the 
algebraic/geometric multiplicities are 3 and 1, respectively. 

Comment. 

This is the second problem of Quiz 13 (Take Home Quiz) for Math 2568 (Introduction to Linear Algebra) 
at OSU in Spring 2017. 

The first problem of Quiz 13 is “Quiz 13 (Part 1) Diagonalize a matrix. 46 . 

Find all the eigenvalues and eigenvectors of the matrix 








10001 

3 

5 

7 

9 

11 

1 

10003 

5 

7 

9 

11 

1 

3 

10005 

7 

9 

11 

1 

3 

5 

10007 

9 

11 

1 

3 

5 

7 

10009 

11 

1 

3 

5 

7 

9 

10011 


(MIT) 


Solution. 


Let 


B = A - 100007, 


where 7 is the 6 X 6 identity matrix. That is, we have 


1 3 5 7 9 
1 3 5 7 9 
1 3 5 7 9 
1 3 5 7 9 
1 3 5 7 9 
1 3 5 7 9 


11 

11 

11 

11 

11 

11 


Since all the rows are the same, the matrix B is singular and hence 1 — 0 is an eigenvalue of B. 

Let us determine the geometric multiplicity of 1 = 0 (namely, the dimension of the null space of B ). 


We apply elementary row operations to B and obtain 


elementary row operations 

B ---► 


13 5 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


7 9 
0 0 
0 0 
0 0 
0 0 
0 0 


11 

0 

0 

0 

0 

0 


Thus, if 7?x = 0 , then we have 

Xi = —3x2 — 5X3 — 7X4 — 9X5 — 11X6- 
It follows from this that basis vectors of the eigenspace Eq — M{B) are 


-3 


-5 


-7 


-9 


-11 

1 


0 


0 


0 


0 

0 


1 


0 


0 


0 

0 

5 

0 

? 

1 

5 

0 

5 

0 

0 


0 


0 


1 


0 

0 


0 


0 


0 


1 


and hence the geometric multiplicity corresponding to X = 0 is 5. 
By inspection, we see that 



















where 


By — 36v, 


1 

1 

1 

v = 

1 

1 

_ 1 _ 

Thus, it yields that A — 36 is an eigenvalue of B and v is a corresponding eigenvector. 


Recall that the algebraic multiplicity of an eigenvalue is greater than or equal to the geometric multiplicity. 

Also the sum of algebraic multiplicities of all eigenvalues of B is equal to 6 since B is a 6 X 6 matrix. 

It follows from this observation that we determine that the algebraic multiplicity of A — 0 is 5 and the 
algebraic and geometric multiplicities of A — 36 are both 1. 

Hence the vector v form a basis of the eigenspace E 36 . 


Now that we have determined eigenvalues and eigenvectors of B , we can deduce those of A as follows. 

In general, if A — B + cl , then the eigenvalues of A are A + c , where A are eigenvalues of B . 

The eigenvectors for A corresponding to A + c are exactly the eigenvectors for B corresponding A . 
(See the post “Eigenvalues and algebraic/geometric multiplicities of matrix^ + cl ” for a proof.) 


In the current problem, we have A — B + 10000/ , and thus c — 10000. 
Therefore, the eigenvalues of A are 10000,10036 . 

Eigenvectors corresponding to 10000 are 


'-3' 


■-5' 


'-7' 


'-9' 


■-11' 

1 


0 


0 


0 


0 

0 


1 


0 


0 


0 


+ X3 


+ X4 


+ X5 


+ X6 


0 


0 


1 


0 


0 

0 


0 


0 


1 


0 

. 0 . 


. 0 . 


. 0 . 


. 0 . 


1 


where (x 2 ,X 3 ,X 4 ,X 5 ,X 6 ) / (0,0,0,0,0,0) . 


The eigenvector corresponding to 10036 is 

"1 

1 

1 

a 

1 

1 

1 
















where a is any nonzero scalar. 


Consider the matrix 



2 

-3/2. 


E M2x2 • 


(a) Find the eigenvalues and corresponding eigenvectors of A. 

E M 2 , we can choose n large enough so that the length || A n v|| is as small as we 

like. 

(University of California, Berkeley) 


(b) Show that for v = 


Proof. 

(a) Find the eigenvalues and corresponding eigenvectors of A . 

To find the eigenvalues of A , we compute the characteristic polynomial p{t) of A . 
We have 


p{t) — det(^ — tl) 

3/2- t 2 
-1 -3/2 - t 

= t 2 - 1/4. 


Since the eigenvalues are roots of the characteristic polynomials, the eigenvalues of A are ±1/2. 

Next we find the eigenvectors corresponding to eigenvalue 1/2. 

These are the solutions of (A — ^/)x = 0 . 

We have 


' 1 

2 ' 


'1 

2 ' 

.-1 

- 2 . 

-> 

.0 

0 . 


Thus, the solution x satisfies x i = — 2x2, and the eigenvectors are 


x = x 2 


where x 2 is a nonzero scalar. 


Similarly, we find the eigenvectors corresponding to the eigenvalue —1/2. 
We solve (A + \l)x = 0 . 

We have 


A + 



' 2 

2 ' 


'1 

1' 



-> 



.-1 

-1. 

then R 2 ~^~R 1 

.0 

0. 


Thus, we have x i = —x 2 , and the eigenvectors are 





















where x 2 is a nonzero scalar. 


(b) We can choose n large enough so that the length | A " v is as small as we like. 


We express the vector v = 


as a linear combination of eigenvectors 


and 


corresponding to eigenvalues 1/2 and —1/2, respectively. 
Let 


= c 1 


-2 

1 


C 2 


-1 

1 


for some scalars Ci, C 2 ■ 

Solving this for Ci, C 2 , we find that C \ — — 1 and C 2 = 1 , and thus we have 


1 

0 . 

Then for any positive integer n, we have 


-2 

1 


A n 


= -A n 


-2 

1 


+ A n 


-1 

1 


-1 

1 


i y 

'- 2 ' 

( 1 Y 

' -1 

- 


+ — 


2 J 

. 1 . 

V *) 

_ 1 _ 


1 \ n \ 2 - (- 1 )" 


2 ) L-i + (-i)"J 


Note that in the second equality we used the following fact: If Ax — lx , then A n x — 1 n x . 


Then the length is 


A n 


< 


(2 - (-1)” ) 2 + (-l + (-l)”) 2 

v/F+7 


v/13 ( — ) 0 as n tends to infinity. 


Therefore, we can choose n large enough so that the length \\A n v|| is as small as we wish. 


Let F and H be an n X n matrices satisfying the relation HF — FH — —2 F. 

(a) Find the trace of the matrix F. 

(b) Let A be an eigenvalue of H and let v be an eigenvector corresponding to A. Show that there exists an 
positive integer N such that F N v = 0. 

Hint. 


For (a), take the trace of the both sides of the given relation. 










































0 For (b), show that if F k v 7 ^ 0 then there are infinitely many eigenvalues, hence a 
contradiction. 


Proof. 

(a) The trace of the matrix F 

Using the given relation we compute the trace of F as follows. 

By taking the trace of both sides we have 

tr(-2F) = tr (HF - FH). 

The right hand side is — 2tr(i^) and the left hand side is 

tr (HF - FH) = tr (HF) - tr (FH) 

= tr {HF) - tr {HF) = 0. 

Therefore we have tr(i^) = 0. 

(b) There exists an positive integer N such that = 0. 

Since v is an eigenvector corresponding to the eigenvalue A of H, we have Hw — Av or 
equivalently (H — A/)v = 0. 

Now we compute 

F(H - A I) — FH — XF 
= (HF + 2 F) — XF = (H — (X — 2 )I)F. 

Therefore we have 

F(H - A I) = (H - (A - 2 )I)F. 

Evaluating at v, we obtain 

0 = F(H - A/)v = (H - (A - 2)I)Fv. 

If Fw 0, then this equality implies that Fw is an eigenvector corresponding to the eigenvalue 
A — 2 of H. In this case we further calculate 

0 = F(H - (A - 2 )I)Fw 

= (FH - (A - 2 )F)F = (HF + 2F - (A - 2 )F)Fw 
= (H-(A- 4))F 2 v. 

If the vector F 2 v ^ 0, then this equality implies that F 2 w is an eigenvector corresponding to the 
eigenvalue A — 4 of H. 

Repeating this procedure, we see that 

0 = {H- (A - 2 k))F k v 



for all k. 

Therefore, if F k v is nonzero vector for all k , then there are infinitely many eigenvalues A — 2k but 
this is impossible since H is an n X n matrix and hence H has at most n eigenvalues. Therefore 
there exists N such that F N v = 0. 




Let A and B be n X n matrices and assume that they commute: AB — BA. Then prove that the matrices A and 
B share at least one common eigenvector. 


Proof. 

Let A be an eigenvalue of A and let x be an eigenvector corresponding to A . That is, we have Ax = Ax . 
Then we claim that the vector v := Bx belongs to the eigenspace Ex of A. 

In fact, as AB = BA we have 


Av = ABx = BAx — ABx = Ay. 


Hence v E Ex . 


Now, let {xi,..., x^} be an eigenbasis of the eigenspace Ex . 

Set V/ = Bxi for i — 1,..., k . 

The above claim yields that V/ £ Ex , and hence we can write 

V i = Ci,-Xi + C 2 ,-X 2 H-b CfcX* (*) 

for some scalars C \\, c 2 /,..., Cfo . 


Extend the basis {xi,..., x^} of to a basis 

{X 1, . . . , X^, X^i r • *)X/i} 

of R” by adjoining vectors x^ + i,..., x n . 


Then we obtain using (*) 


5[xi,...,x / t,Xyt+i,...,x„] 

= [5xi,..., Bx k , Bx k + 1 ,..., Bx n ] 
= [v 1 , . . . , V*: , BXk +1 ,..., 5x„ ] 


— [xi,...,Xyfc,X A+1 ,...,X„] 


c 

5) 

0 

F 









where C — (c z y ) is the k X k matrix whose entries are the coefficients Cy of the linear combination (*), 
O is the (n — k) X k zero matrix, D is a k X (n — k) matrix, and F is an (n — k) X (n — k) 
matrix. 


LetP = [xi,...,x*,x* + i,...,x„] • 

As the column vectors of P are linearly independent, P is invertible. 
From (**), we obtain 


It follows that 


det(5 - tl) = det(P ~ l BP - tl) = 


1 


’ C 

D 

L BP = 





o 

F 


C- 

tl 

D 


O 


F -tl 


= det(C — tl) detf/ 7 — tl). 


Let // be an eigenvalue of the matrix C and let a be an eigenvector corresponding to /i . 
Then as det(C — jul) — 0 , we see that det(5 — jul) — 0 and // is an eigenvalue of B . 

Write 


and define a new vector by 


a\ 


a = 


a 2 


^0 




y = aixi H - h a k yi k e E x . 

Then y is an eigenvector in Ex since it is a nonzero (as a 0 ) linear combination of the basis Ex . 


C 

D 

O 

F 


Multiplying BP — P 


from (**) by the n -dimensional vector 





















c 

D 

0 

F 


a 

0 

0 


The left hand side is equal to 

B[a x xi H- \-a k x k ] = By. 

On the other hand, the right hand side is equal to 


C a 
0 


= P 


/Ll& 

0 


= juP 


0 


= ju[a 1 x 1 H-h a k x k ] = juy. 


Therefore, we obtain 


By = juy. 

This proves that the vector y is eigenvector of both A and B . 
Hence, this completes the proof 


Let a and b be two distinct positive real numbers. Define matrices 

0 b 

b 0. * 

Find all the pairs (A, X), where A is a real number and X is a non-zero real matrix satisfying the relation 

AX + XB = XX. 


A 


0 a 
a 0 


, B:= 


(The University of Tokyo) 



O 

0 

0 


Let X = 


xi 

_^3 


X2 

X4 


and compute (*). 


Compare entries of matrices. 


Rewrite it as a matrix equation. 


The problem is now translated to a problem of eigenvalue/eigenvectors. 


Solution. 




























Let X = 


xi 

_^3 


X 2 

X4 


Then the relation (*) becomes 


~x 3 

£4 

+ 6 

X2 


= A 

Xi 

%2 

. X 1 

x 2 _ 


£4 

^3_ 


_^3 

X4_ 


Comparing the entries of matrices, we obtain four equations 

ax 3 + bx 2 — Xx 1 
ax 4 + bx 1 = A#2 
a#! + 6x4 = Ax 3 
a#2 + bx 3 = Aa?4. 


We rewrite these equations into the matrix equation 


—A b a 0 


Xi 


'o' 

b —A 0 a 


X 2 


0 

a 0 —A b 


X3 


0 

0 a b —A. 


-X 4 _ 


.0. 


Thus the problem is to find all the pairs (A, X) satisfying this matrix equation. 

b a 0 

0 0a 

0 0 6 ' 

a 6 0_ 

Then the problem is equivalent to find all eigenvalues and eigenvectors of the matrix C. 
So we first compute the characteristic polynomial of C to find eigenvalues. 

We have 


Let 


c = 


det (C - A I) 


-A 

b 


a 

0 

b 

-A 


0 

a 

a 

0 


-A 

b 

0 

a 


6 

-A 

A 4 - 

2 (a 2 

+ 

cr 

to 

to 

+ 1 


b 2 ) 2 . 


(Use cofactor expansion and simplify to obtain this.) 

We solve 

A 4 - 2 (a 2 + b 2 ) A 2 + (a 2 - 6 2 ) 2 = 0 

for A. 

By the quadratic formula we have 

A 2 = a 2 +b 2 ± (a 2 + 6 2 ) 2 - (a 2 - 6 2 ) 2 

= a 2 + 6 2 i 2 ab — (a =b 6) 2 . 





















Hence we obtained four eigenvalues A = =b(a =b b). 

Note that since we have four distinct eigenvalues, each eigenspace is one dimensional. 


Now, let us find eigenvectors. First consider the eigenvalue A — a + b. 
In this case, 


c 


A I = 


-a — b b a 0 

b —a — b 0 a 

a 0 —a — b b 

0 a b —a — b 


The eigenvector is the solutions of (C — A/)x = 0 . The typical method to find eigenvector is to 
reduce the matrix C — XI into row echelon form. 

But as we noted earlier each eigenspace is one dimensional, so if we find one nonzero vector solution, 
then it is a basis of the eigenspace. 


For this specific matrix, we see that x = 


satisfies (C — AI)x = 0 . Therefore all the 


eigenvectors corresponding to eigenvalue A = a + b is 

1 
1 

x = 

1 
1 


for any nonzero scalar t . 


Similarly, we find that 


'-1 


' 1 " 


■ i" 

1 

t, 

-1 

£, 

i 

1 


1 


-i 

.-1. 


.-1. 


.-i. 


for any nonzero t are all the eigenvector corresponding to eigenvalue A = — a — 6 , a — 6, — a + 6, 
respectively. 

Therefore the solution to the problem is that the pair (A, X) is one of the following paris. 


a T - b) t 
a — b,t 


"1 1 
_1 1 
1 -1 
1 -1 


a — 6, t 

J ( — CL + 6, t 


-1 1 
1 -1 

1 1 

-1 -1 


for nonzero t (because X must be a nonzero matrix.) 























Diagonalization of Matrices 


The general procedure of the diagonalization is explained in the post “How to Diagonalize a Matrix. Step by Step 
Explanation* 4 . 


Definition 


Let A, B be n X n matrices. 

A and B are similar if there exists a nonsingular matrix P such that P _1 AP = B . 

^ A is diagonalizable if there exist a diagonal matrix D and nonsingular matrix P such that 
P _1 AP = D. (Namely, if A is diagonalizable if it is similar to a diagonal matrix.) 

A is said to be defective if there is an eigenvalue A of A such that the geometric multiplicity of A is 
less than the algebraic multiplicity of A. 


Summary 


Let A, B be n X n matrices. 

If A and B are similar, then the characteristic polynomials of A and B are the same. Hence the 
eigenvalues of A, B and their algebraic multiplicities are the same. 

A is diagonalizable if and only if A is not defective. 

A is diagonalizable if and only if M n has an eigenbasis of A (a basis consisting of eigenvectors). 

A is diagonalizable if and only if there are n linearly independent eigenvectors of A. 

^ If A has n distinct eigenvalues, then A is diagonalizable. 

If Vi,.. ., v n are linearly independent eigenvectors of A corresponding to the eigenvalues 
Ai,.. ., A n (not necessarily distinct), then S~ l AS = D , where S = [vi ,..., v n ] and 
D = diag(Ai,..A n ). 


Problems 

Let A, B, and C be n X n matrices and 7 be the n X n identity matrix. Prove the following statements. 

(a) If A is similar to B, then B is similar to A. 

(b) A is similar to itself. 

(c) If A is similar to B and B is similar to C, then A is similar to C. 

(d) If A is similar to the identity matrix /, then A — I. 

(e) If A or B is nonsingular, then AB is similar to BA. 

(f) If A is similar to B, then A k is similar to B k for any positive integer k. 


Definition. 

















We say that a matrix A is similar to a matrix B is there exists a nonsingular (invertible) matrix P such 
that 


B = P~ l AP. 


Proof. 

(a) If A is similar to B , then B is similar to A . 

If A is similar to B , then there exists a nonsingular matrix P such that B — P ~ l AP . 
Let Q — P 1 . Since P is nonsingular, so is Q. 

Then we have 


Q l BQ = {P~ l )~ l BP~ l = PBP ~ l 
= P(P _1 = IAI = > 4 . 


Hence 5 is similar to v4 . 

(b) ^4 is similar to iteself. 

Since the identity matrix / is nonsingular and we have 

A=I~ 1 AI, 

the matrix A is similar to A itself. 

(c) If A is similar to B and B is similar to C, then A is similar to C. 
If A is similar to B , we have 

B = P~ 1 AP , 


for some nonsingular matrix Z 5 . 

Also, if 5 is similar to C, we have 

c = e 

for some nonsingular matrix Q. 

Then we have 

c = Q X BQ 
= Q~ 1 (P~ 1 AP)Q 
= (PQ)~ l A(PQ). 


Let R = PQ . Since both/ 5 and 2 are nonsingular, R — PQ is also nonsingular. 
The above computation yields that we have 

C = R _1 AR , 


hence v4 is similar to C. 


(d) If A is similar to the identity matrix I , then A = I. 

Since A is similar to /, there exists a nonsingular matrix P such that 

A = P ~ l IP. 

Since P ~ l IP = / , we have A = / . 

(e) If/I or 5 is nonsingular, then // 5 is similar to BA . 

Suppose first thatv4 is nonsingular. Thenv4 is invertible, hence the inverse matrix A -1 exists. 
Then we have 

A ~ 1 (AB)A = A ~ l ABA = IB A = BA, 


hence AB and BA are similar. 

Analogously, if B is nonsingular, then the inverse matrix B~ l exists. 
We have 


B~ l (BA)B = B k BAB = IAB = AB, 


hence AB and BA are similar. 

(f) If A is similar to B, then A k is similar to B k . 

If A is similar to B , then we have 

B = P ~ l AP 


for some nonsingular matrix P . 

Then we have for a positive integer k 

B k = (P~ l AP) k 

= (P ~ 1 AP)(P ~ l AP ) • • • (P -^P) 

V -V- 

k times 

= p -i A k P 


since we can cancel P and P 1 in between. 

Hence A k and B k are similar. 

Comment. 

Part (a), (b), (c) show that similarity is an equivalence relation. 


Prove that if A and B are similar matrices, then their determinants are the same. 




Proof. 

Suppose that A and B are similar. Then there exists a nonsingular matrix S such that 

S ~ l AS = B 

by definition. 

Then we have 

det( 5 ) 

= det(S _1 AS) 

= det(S)- 1 det(, 4 )det( 1 S) 

(by multiplicative properties of determinants) 

= det(d) 

(since determinants are just numbers, hence commutative). 

More generally, we can prove that if A and B are similar, then their characteristic polynomials are the 
same. 

From this, we also can deduce that the determinants of A and B are the same as well as their traces are the 
same. 


A 

(a) Is the matrix A — 

(b) Is the matrix A — 

(c) Is the matrix A — 

(d) Is the matrix A — 


1 2 

0 3 
0 1 

5 3 

-1 

-2 

-1 

-2 


similar to the matrix B — 

'3 O' 


.1 2. 

similar to the matrix B — 

'1 2" 


.4 3. 


6 

6 

6 

6 


similar to the matrix B = 
similar to the matrix B — 


0 2 
1 2 

-1 4 


? 


□ 


Determine whether the matrix A — 

matrix D such that S~ l AS — D. 
(The Ohio State University) 


1 4 

2 3 


is diagonalizable. If so, find a nonsingular matrix S and a diagonal 


Solution. 

To determine whether the matrix A is diagonalizable, we first find eigenvalues of A . 
To do so, we compute the characteristic polynomial p{t) of A: 


pit) = 


i - 1 

2 


4 

3- 


( 1 - 0 ( 3-0 -8 


= t 2 — At — 5 = (t + 1 ){t — 5). 


The roots of the characteristic polynomial p{t) are eigenvalues ofv4 . 
Hence the eigenvalues of A are — 1 and 5. 


Since the 2 X 2 matrix A has two distinct eigenvalues, it is diagonalizable. 















To find the invertible matrix S , we need eigenvectors. 

Let us find the eigenvectors corresponding to the eigenvalue — 1. 
By elementary row operations, we have 


A - (-1 )I = A+I = 


Ri—R-i 


2 4 
0 0 


7*1 


2 4 
.2 4 

1 2 
0 0 


It follows that the eigenvectors corresponding to — 1 are of the form 


-2 

1 


for any nonzero scalar a. 


Similarly, we have 


A-51 = 


r-4 

4 1 

f* 1 

n 

-ii 

R 2 -2R 1 

n 

-ii 

i 

to 

1 

to 

i_ 

-> 

i 

to 

1 

to 

1_ 

-> 

i 

o 

-1 

o 


Hence the eigenvectors corresponding to 5 are of the form 

b 


for any nonzero scalar b. 


Thus, u 


2 

1 


and v 


arc basis vectors of eigenspace E-\,E 5 , respectively. 


Define 


S := [u v] 


-2 1 

1 1 


Then by the general procedure of the diagonalization, we have 

S~ 1 AS = D, 


where 


D := 



0 

5 


A 

Diagonalize the 2x2 matrix A — 
such that S~ X AS — D. 


-1 

2 


by finding a nonsingular matrix S and a diagonal matrix D 
































Solution. 

The characteristic polynomial p{t) of the matrix A is 

p{t) = det(^ - tl) = 2 ^ 2 

= (2 - t ) 2 - 1 = t 2 - 4t + 3 
= (/-l)(/-3). 

It follows that the eigenvalues ofv4 are X — 1,3 with algebraic multiplicities are both 1. 

Hence, the geometric multiplicities are 1 and thus any nonzero vector in eahc eigenspace forms a 
eigenbasis. 


Now let us find a eigenbasis for each eigenspace E % — J\f (A — XI) . 
For the eigenvalue 1, we have 


A — I — 


1 -1 

-1 1 


R 2 ~\~R i 
-» 


1 -1 

0 0 


This yields that the eigenvectors corresponding to the eigenvalue 1 are x' 


with x 2 ^ 0. Hence 


vi 


e Ei 


is an eigenbasis for E \ 


Similarly, as we have 

A - 3/ = 

we see that 


-1 

-1' 

-*1 

' 1 

1 ' 

R 2 ~^~R 1 

'1 

1' 

.-1 

-1. 

- > 

.-1 

-1. 

- > 

.0 

0. 


V 2 = 


-1 

1 


e E 3 


is an eigenbasis for E 3 


Let 


S = 


vi v 2 J 


1 -1 

1 1 


and D = 


1 0 
0 3 


Then the diagonalization procedure yields that S is nonsingular and S 1 AS = D . 




























Diagonalize the matrix 


A = 


4 

3 

-1 


-3 - 3 " 

-2 -3 

1 2 


by finding a nonsingular matrix S and a diagonal matrix D such that S 1 ^45 = D. 


Diagonalization Procedure 

Letv4 be the n X n matrix that you want to diagonalize (if possible). 

Find the characteristic polynomial p(t) of A . 

Find eigenvalues A of the matrix A and their algebraic multiplicities from the characteristic 
polynomial p{t) . 


For each eigenvalue A of A , find a basis of the eigenspace Ex . 

If there is an eigenvalue A such that the geometric multiplicity of A , dim^^ ), is less than the 
algebraic multiplicity of A , then the matrix A is not diagonalizable. If not, A is diagonalizable, 
and proceed to the next step. 


If we combine all basis vectors for all eigenspaces, we obtained n linearly independent 
eigenvectors v i, V2,..., v n • 


0 Define the nonsingular matrix 


s = [viv 2 .. .V w ]. 


0 Define the diagonal matrix D, whose (/, i) -entry is the eigenvalue A such that the /-th column 
vector Vi is in the eigenspace Ex . 


0 Then the matrix A is diagonalized as 

S~ l AS = D. 


Example of a matrix diagonalization 

Now let us examine these steps with an example. 
Let us consider the following 3x3 matrix. 








We want to diagonalize the matrix if possible. 

Step 1: Find the characteristic polynomial 
The characteristic polynomial p{t) of A is 

p(t ) = det(^ — tl) 


4 -t -3 -3 

3 -2 -t -3 

-1 1 2-t 


Using the cofactor expansion, we get 

p{t) = -{t-lf{t-2). 

Step 2: Find the eigenvalues 

From the characteristic polynomial obtained in Step 1, we see that eigenvalues are 

X — 1 with algebraic multiplicity 2 


and 


X — 2 with algebraic multiplicity 1. 

Step 3: Find the eigenspaces 

Let us first find the eigenspace E \ corresponding to the eigenvalue A — 1. 
By definition, E\ is the null space of the matrix 


' 3 

-3 

-3' 


'1 

-1 

- 1 " 

3 

-3 

—3 


0 

0 

0 

.-1 

1 

1 _ 


.0 

0 

0 . 


by elementary row operations. 

Hence if (A — I)x — 0 for x E 1R 3 , we have 

x\ = x 2 +x 3 . 


Therefore, we have 


Ei 


AT (A -1) = 


x G R 3 



v 


' 1 ' 

1 

X = x 2 

1 

+ X3 

0 

l 


.0. 


.1. 

i 


From this, we see that the set 


f 

' 1 " 


" 1 " 

\ 

I 

1 

5 

0 

> 

l 

. 0 . 


_ 1 _ 

> 




















is a basis for the eigenspace E \. 

Thus, the dimension of E \ , which is the geometric multiplicity of A — 1, is 2. 


Similarly, we find a basis of the eigenspace E 2 = Af (A — 21) for the eigenvalue >1 = 2. 
We have 



" 2 

-3 

—3" 



'1 

0 

3' 

A -21 = 

3 

-4 

-3 

—► * ' 

• * —» 

0 

1 

3 


-1 

1 

0 



0 

0 

0 


by elementary row operations. 

Then if (A — 21) x = 0 for x E IR 3 , then we have 


x\ — —3x3 andx 2 = —3x3. 


Therefore we obtain 


E 2 = Af (A -2 1)= { x E 


x =x 3 


-3 

-3 

1 


From this we see that the set 

-3 

-3 

1 . 

is a basis for the eigenspace E 2 and the geometric multiplicity is 1. 

Since for both eigenvalues, the geometric multiplicity is equal to the algebraic multiplicity, the matrix A 
not defective, and hence diagonalizable. 


Step 4: Determine linearly independent eigenvectors 
From Step 3, the vectors 



"1" 


'1" 


'-3' 

Vl = 

1 

,v 2 = 

0 

,v 3 = 

-3 


0 


1 


1 


are linearly independent eigenvectors. 

Step 5: Define the invertible matrix S 
Define the matrix S = [V 1 V 2 V 3 ] . Thus we have 


5 = 


1 

1 

0 


1 

0 

1 


-3 

-3 

1 


and the matrix S is nonsingular (since the column vectors are linearly independent). 



















Step 6: Define the diagonal matrix D 

Define the diagonal matrix 


D = 


1 0 
0 1 
0 0 


0 

0 

2 


Note that (1,1) -entry of D is 1 because the first column vector v i 
, that is, v i is an eigenvector corresponding to eigenvalue X — 1 . 
Similarly, the (2,2) -entry of D is 1 because the second column v 2 

The (3,3) -entry of D is 2 because the third column vector V 3 = 


1 

1 

0 . 

1 

0 

1 


of S is in the eigenspace E 1 


of S is in E 1 . 


3 


of S is in E 2 . 


1 


(The order you arrange the vectors v 1 , v 2 , v 3 to form S does not matter but once you made S , then the 
order of the diagonal entries is determined by S , that is, the order of eigenvectors in S .) 

Step 7: Finish the diagonalization 

Finally, we can diagonalize the matrix^ as 


S~ 1 AS = D, 

where 


'1 

1 

— 3 " 


'1 

0 

o' 

1 

0 

—3 

and D — 

0 

1 

0 

.0 

1 

1 . 


.0 

0 

2. 


(Here you don’t have to find the inverse matrix S 1 unless you are asked to do so.) 


Diagonalization Problems and Examples 

Check out the following problems about the diagonalization of a matrix to see if you understand the 
procedure. 

Problem. Diagonalize 

and compute A 100 . 


A = 


1 2 
4 3 

















For a solution of this problem and related questions, see the post “Diagonalize a 2 by 2 Matrix A and 
Calculate the Power A 100 


Problem. Determine whether the matrix 


A 


0 10 

-10 0 
0 0 2 


is diagonalizable. If it is diagonalizable, then find the invertible matrix S and a diagonal matrix D 
such that S ~ 1 AS — D . 


For a solution, check out the post “Diagonalize the 3 by 3 Matrix if it is Diagonalizable 11 . 


Problem. Let 


A = 


2 

-1 

-1 


-1 -1 

2 -1 

-1 2 


Determine whether the matrix A is diagonalizable. If it is diagonalizable, then diagonalize A . 


For a solution, see the post “Quiz 13 (Part 1) Diagonalize a matrix. 11 . 


Problem. Diagonalize the matrix 


A = 


1 1 
1 1 
1 1 


1 

1 

1 


In the solution given in the post “Diagonalize the 3 by 3 Matrix Whose Entries are All One 11 , we use an 

indirect method to find eigenvalues and eigenvectors. 

The next problem is a diagonalization problem of a matrix with variables. 


Problem. 

Diagonalize the complex matrix 

a 

Using the result of the diagonalization, compute A k for each k £ N. 
The solution is given in the post - * 

Diagonalize the Upper Triangular Matrix and Find the Power of the Matrix 


A = 


a b — 
0 b 










A Hermitian Matrix can be diagonalized by a unitary matrix 

Theorem. If A is a Hermitian matrix, then A can be diagonalized by a unitary matrix U . 
This means that there exists a unitary matrix U such that U~ l AU is a diagonal matrix. 

Problem. 

Diagonalize the Hermitian matrix 


by a unitary matrix. 


A = 


1 i 

—i 1 


The solution is given in the post i 

Diagonalize the 2x2 Hermitian Matrix by a Unitary Matrix 

More diagonalization problems 

More Problems related to the diagonalization of a matrix are gathered in the following page: 

Diagonalization of Matrices 




Suppose that A and P are 3 X 3 matrices and P is invertible matrix. 


If 


P~ X AP = 


12 3 " 

0 4 5, 

0 0 6 


then find all the eigenvalues of the matrix A 2 . 


Proof (short version). 

Let B = P ~ l AP . Since B is an upper triangular matrix, its eigenvalues are diagonal entries 1, 4, 6. 

Since A and B = P ~ l AP have the same eigenvalues, the eigenvalues of A are 1,4, 6. Note that these 
are all the eigenvalues of A since A is a 3 X 3 matrix. 

It follows that all the eigenvalues of A 2 are 1,4 2 ,6 2 , that is, 1,16, 36 . 

Proof (long version.) 

Let us put B := P~ l AP . 

The eigenvalues of B are 1,4, 6 since B is an upper triangular matrix and eigenvalues of an upper 
triangular matrix are diagonal entries. 

We claim that the eigenvalues of A and B are the same. 






To prove this claim, we show that their characteristic polynomials are equal. 
Let Pa ( t ) ,Pb ( t ) be the characteristic polynomials of A,B , respectively. 
Then we have 


Ps(t) = det (B - tl) = det (P~ l AP - tl) 
= det(P _1 (^ - tI)P) 

= det(P _1 ) det(^ - tl) det(P) 

= det (P)~ 1 pA (t) det(P) 

= Pa (()• 


Therefore, the characteristic polynomials of A and B are the same, and hence the matrices A and B have 
the same eigenvalues. 

Thus 1, 4, 6 are eigenvalues of the matrix A and these are all the eigenvalues of A since a 3 X 3 matrix 
has at most three eigenvalues. 


Now we claim that in general if 1 is an eigenvalue of A , then X 2 is an eigenvalue of A 2 . We prove this 
statement below. Assuming this claim for the moment, we finish the problem. 

By the claim, the matrix A 2 has eigenvalues l 2 ,4 2 ,6 2 . Thus all the eigenvalues of A 2 are 

1,16,36. 


To complete the solution, let us prove the last claim. 

Since X is an eigenvalue of A , we have an eigenvector x such that 

Ax — Ax. 


Multiplying by A we obtain 

A 2 Tx — A(Ax) 
— A(Ax) 
— A 2 x 


and thus A 2 is an eigenvalue of A 2 . 


Let A = 


1 

2 


2 

1 


. Compute A n for any n G N. 


Plan. 


We diagonalize the matrix A and use this Problem. 





0 Find eigenvalues and eigenvectors of the matrix A . 


0 Diagonalize the matrix A . 


0 Use the result of this Problem. 


Proof. 

We first diagonalize the matrix A . We solve 

1 - A 


det(^ - XI) = 

Z i — A 

= (1 - X) 2 - 4 = X 2 - 2X - 3 = (X + 1)(A - 3) = 0 


and obtain the eigenvalues X — —1,3. 

To find an eigenvector x corresponding to X = — 1, we solve (A + I)x — 0 or 


'2 

2' 


"xi" 


'o' 

.2 

2. 


*2. 


.0. 


We obtain an eigenvector x = 


corresponding to X — — 1. 


Similarly, solving (A — 3/)y = 0 , we obtain an eigenvector y = 
corresponding to X — 3 . 


Thus the invertible matrix S — [x y] 


1 1 

-1 1 


S -'AS = 


-1 0 

0 3 


diagonalizes the matrix A , that is 
or equivalently A = S 


-1 0 

0 3 


S~'. 


Then for each n G N, we have 


A n = 5 


-1 0 
0 3 


= 5 


-1 0 
0 3 


s=S 


(- 1 )" 0 

0 3" 


' i r 

'(-1)" o' 

1 

'1 

-1' 

1 

■ + 3 n 

(- 1)” +1 + 3 " ' 

.-i i. 

.0 3\ 

2 

.1 

1 . 

= 2 

,(-l)" +1 + 3 " 

(_1)« + 3 n 


(See this problem for the details of these computations.) 
Therefore we obtained the formula 


^=2 


(—1)" + 3" (—1)" +1 + 3" ’ 

(-1) B+1 + 3" (-1)" + 3" 


Comment. 







































Another typical method to compute a power of a square matrix is mathematical induction. To use it, we 
need to first compute several small powers like A 2 and A 3 and guess the formula for A n . 

If you can guess the formula, then the mathematical induction part is not difficult. But for this specific 
problem, the formula is a bit complicated to guess as you can see from the solution above. Thus we used 
diagonalization trick. 


Let A , B be matrices. Show that if A is diagonalizable and if B is similar to A, then B is diagonalizable. 


Definitions/Hint. 

Recall the relevant definitions. 




Two matrices A and B are similar if there exists a nonsingular (invertible) matrix S such that 


S~ 1 BS = A. 


o 


A matrix A is diagonalizable if A is similar to a diagonal matrix. Namely, A is diagonalizable if 
there exist a nonsingular matrix S and a diagonal matrix D such that 


S~ 1 AS = D. 


Some useful facts are 


If S and T are invertible matrices, then we have 

(rsy 1 = s~ 1 t~ 1 . 

(Note the order of the product.) 




A matrix is nonsingular if and only if its determinant is nonzero. 


Proof. 

Since the matrix A is diagonalizable, there exist a nonsingular matrix S and a diagonal matrix D such that 

S~ l AS = D. (*) 

Also, since B is similar to A , there exist a nonsingular matrix T such that 

T 1 BT=A. (**) 

Inserting the expression ofv4 from (**) into the equality (*), we obtain 




D = S~ l (T~ l BT)S 
= (S~ 1 T- 1 )B(TS) 

= (TSy'BiTS). (***) 


Now let us put U TS . Then the matrix U is nonsingular. 
(This is because we have 


det (U) = det(rS) = det(r)det(5) ^ 0 
since T and S are nonsingular matrices, hence their determinants are not zero.) 
Therefore from (***) we have 

D = U~ l BU , 


where D is a diagonal matrix and U is a nonsingular matrix. 
Thus B is a diagonalizable matrix. 

0 (a) Is every diagonalizable matrix invertible? 

(b) Is every invertible matrix diagonalizable? 


Solution. 

The answer is No. 

Counterexample 

We give a counterexample. Consider the 2 X 2 zero matrix. 

The zero matrix is a diagonal matrix, and thus it is diagonalizable. 

However, the zero matrix is not invertible as its determinant is zero. 

More Theoretical Explanation 
Let us give a more theoretical explanation. 

If an n X n matrix A is diagonalizable, then there exists an invertible matrix P such that 

0 ••• 0 " 

A 2 * * * 0 

• . • 5 

0 ••• X n . 

where X \ ,..., X n are eigenvalues of A . 

Then we consider the determinants of the matrices of both sides. 

The determinant of the left hand side is 

det(P AP) — det(P) -1 det(^) det(P) = det(^). 


P-'AP = 


X 

o 



On the other hand, the determinant of the right hand side is the product 





since the right matrix is diagonal. 
Hence we obtain 


det(^4) = A 1 A 2 • • . 

(Note that it is always true that the determinant of a matrix is the product of its eigenvalues regardless 
diagonalizability. 

See the post “Determinant/trace and eigenvalues of a matrix 46 .) 

Hence if one of the eigenvalues of A is zero, then the determinant of A is zero, and hence A is not 
invertible. 

The true statement is: 


a diagonal matrix is invertible if and only if its eigenvalues are nonzero. 


Is Every Invertible Matrix Diagonalizable? 

Note that it is not true that every invertible matrix is diagonalizable. 
For example, consider the matrix 


A = 


1 

0 


1 

1 


The determinant of A is 1, hence A is invertible. 
The characteristic polynomial of A is 


p(t ) = det(^4 — tl ) = 


1 

1 - 


(l-O 2 . 


Thus, the eigenvalue of A is 1 with algebraic multiplicity 2. 
We have 


A — I = 


0 1 
0 0 


and thus eigenvectors corresponding to the eigenvalue 1 are 


a 


1 

0 


for any nonzero scalar a. 

Thus, the geometric multiplicity of the eigenvalue 1 is 1. 

Since the geometric multiplicity is strictly less than the algebraic multiplicity, the matrix A is defective and 
not diagonalizable. 


Is There a Matrix that is Not Diagonalizable and Not Invertible? 










Finally, note that there is a matrix which is not diagonalizable and not invertible. 

0 1 ' 


For example, the matrix 


0 0 


is such a matrix. 


Summary 

There are all possibilities. 


© Diagonalizable, but not invertible. 
Example: 


0 0 
0 0 


Invertible, but not diagonalizable. 
Example: 


1 1 
0 1 


© Not diagonalizable and Not invertible. 
Example: 


0 1 
0 0 


© Diagonalizable and invertible 
Example: 


1 0 
0 1 


© 


Determine whether the matrix A = 


0 1 0 
-10 0 


is diagonalizable. If it is diagonalizable, then find the 


0 0 2 . 

invertible matrix S and a diagonal matrix D such that S' -1 AS — D. 


How to diagonalize matrices. 

For a general procedure of the diagonalization of a matrix, please read the post “How to Diagonalize a 
Matrix. Step by Step Explanation* 6 . 















Solution. 

We first determine the eigenvalues of the matrix A . 

To do so, we compute the characteristic polynomial p(t) of A . 
We have 


p{t ) = det(^4 — tl) 
-t 10 
= -1 -t 0 
0 0 2 -t 


= (— 1) 3+3 (2 - t) 
= (2 - t)(t 2 + 1 ). 


by the third row cofactor expansion 


Thus the eigenvalues ofv4 are 2, zb/. 

Since the 3 X 3 matrix A has three distinct eigenvalues, it is diagonalizable. 


To diagonalize A , we now find eigenvectors. 
For the eigenvalue 2, we compute 


A-21 = 


R i <->R o 


'-2 

1 

O' 

R 2 

'-2 

1 

O' 

t —1 

1 

-2 

0 

- > 

1 

2 

0 

. 0 

0 

0. 


. 0 

0 

0. 


TR2 


12 0 ' 
-2 10 


R 2 +2Ri 


1 2 O' 
0 5 0 



. 0 

0 

0. 



.0 

0 

'1 

2 

O' 

Ri 

-2R 2 

'l 

0 

o' 

0 

1 

0 


- » 

0 

1 

0 

.0 

0 

0. 



.0 

0 

0. 


Thus, the solutions x of (A — 21) = 0 satisfy x = y = 0 
Hence the eigenspace is 


f 

'O' 

\ 

E 2 = M(A - 21) = Span 

0 

> 

l 

_ 1 _ 

J 


For the eigenvalue i, we compute 


—i 

1 


0 



' 1 

i 


0 





iR i 





-1 

—i 


0 



-1 

—i 


0 

0 

0 

2 

— i _ 



. 0 

0 

2 

— i 

R 2 
1 

+Ri 

J? „ 

'1 

i 

o ' 



'1 

i 

O ' 

2 - 

-R 3 








—> 

0 

0 

0 


- > 

0 

0 

1 



0 

0 

i . 



.0 

0 

0 . 





































So the solutions x of (A — il)x = 0 satisfy 

x = —iy andz = 0. 


Thus, the eigenspace is 


Ei — Af(A — il ) = Span 


Since i and —i are complex conjugate, their eigenspaces are also complex conjugate. 
Hence the eigenspace for —i is 



From these computations, we have obtained eigenvalues 2, /, —i and eigenvector corresponding to these 
are 


Let 


and 



'o' 


"1' 


" 1 ' 

V2 = 

0 

,v,- = 

i 

,V-f = 

— i 


l 


0 


0 


5 = 


V2 V; V_fJ = 


0 1 1 

0 i —i 

1 0 0 


£> = 


2 0 0 
0 i 0 
0 0 -i 


Then S is invertible and we have S 1 AS = D by the diagonalization process. 




For which values of constants a, b and c is the matrix A 
(The Ohio State University) 


7 a b 
0 2 c 
0 0 3 


diagonalizable? 


Solution. 

Note that the matrix A is an upper triangular matrix. 
Hence the eigenvalues of A are diagonal entries 7,2,3. 




















So the 3 X 3 matrix A has three distinct eigenvalues. 

This implies thatv4 is diagonalizable. 

Hence, regardless of the values of a, c, the matrix A is always diagonalizable. 
Thus, a,b,c can take arbitrary values. 


Let 



" 1 3 3 ' 


"2 4 3 

A = 

-3 -5 -3 

and B — 

-4 -6 -3 


3 3 1 


3 3 1 


For this problem, you may use the fact that both matrices have the same characteristic polynomial: 

Pa (A) = Ps (A) = — (A — 1) (A + 2) 2 . 


(a) Find all eigenvectors of A. 

(b) Find all eigenvectors of B. 

(c) Which matrix A or B is diagonalizable? 

(d) Diagonalize the matrix stated in (c), i.e., find an invertible matrix P and a diagonal matrix D such that 
A = PDP - 1 or B = PDP- 1 . 

(Stanford University) 


Hint. 

See the post how to diagonalize a matrix for a review of the diagonalization process. 

Solution. 

(a) All eigenvectors of A 

The eigenvalues of A are roots of the characteristic polynomial (A) , hence the eigenvalues of A are 

A = 1,-2 


with algebraic multiplicities 1 and 2, respectively. 

Let us first find the eigenvectors corresponding to the eigenvalue X = — 2 . 

The eigenvectors are nonzero solutions of the equation 

(A + 2/)x = 0, 

q 

where / is the 3 X 3 identity matrix, x £ R , and 0 is the three dimensional zero vector. 

To solve the linear system, we use the Gauss-Jordan elimination method. 

Applying elementary row operations we obtain 


‘ 3 

3 

3 ' 



"1 

1 

1 

-3 

-3 

-3 

-)► ‘ ' 

■ * 

0 

0 

0 

. 3 

3 

3 . 



.0 

0 

0. 


Therefore, the vector x = 


xi 

*2 

_*3_ 


must satisfy 













Xi = X 2 x 3 . 

Hence the eigenvectors associated to /l = — 2 are 



1 

1 

X 

to 

1 

X 

CO 


'- 1 " 


'-1 

X = 

1 

X X 

CO to 

= X 2 

1 

. 0 . 

+ x 3 

0 

. 1 . 


for any pair of complex numbers (x i, x 2 ) 7 ^ ( 0 , 0 ). 


Now we find the eigenvectors corresponding to the eigenvalue X = 1. 
As above, we solve the system 

(A - /)x = 0 . 

By elementary row operations, we reduce the matrix A — I and obtain 


I 

II 

‘ 0 

-3 

3 3 ' 

-6 -3 

-A • • • -A 

"1 0 

0 1 

-1' 

1 


3 

3 0 


0 

0 

0 


Hence the vector x should satisfy 

x 1 — x 3 andx 2 = — x 3 . 


Thus the eigenvectors associated to X — 1 are 


x = x 3 


for any nonzero complex number x 3 . 

(b) All eigenvectors of B 

The eigenvalues of B are roots of the characteristic polynomial ps{i), hence the eigenvalues of B are 


A = 1,-2 


with algebraic multiplicities 1 and 2, respectively. 


Let us first find the eigenvectors corresponding to the eigenvalue X = — 2 . 
We need to solve the sysetm 

(.B + 2/)x = 0. 


By elementary row operations, we have 



' 4 

4 

3 ' 



"1 

1 

o' 

B + 21 = 

-4 

-4 

-3 

—> ‘ ' 

• * —> 

0 

0 

1 


3 

3 

3 



0 

0 

0 


















Thus the vector x satisfies 


x\ — —X2 and x 3 = 0, 


hence the eigenvectors associated to A — —2 are 


x = x 2 


-1 

1 

0 


for any nonzero complex number x 2 


Next, we find the eigenvectors corresponding to eigenvalue >1=1. 
Since we have 



' 1 

4 

3 ' 



"1 

0 

-i" 

5-/ = 

-4 

-7 

-3 

-A • - 

• • -A 

0 

1 

1 


3 

3 

0 



0 

0 

0 


by elementary row operations, the system ( B — /)x = 0 has solutions 

jc 1 = X3 andx2 = — X3. 

Thus the eigenvectors associated to A — 1 are 


x = x 3 


1 

-1 

1 


for any nonzero complex number x 3 . 

(c) Which matrix A or B is diagonalizable? 

From the result of (a), for each eigenvalue A of A the geometric multiplicity of A is the same as the 
algebraic multiplicity of A . Thus A is not defective, and hence A is diagonalizable. 

From the result of (b), the geometric multiplicity of the eigenvalue A — —2 of B is 1 although the 
algebraic multiplicity of A — — 2 is 2. Thus the matrix B is defective and hence B is not diagonalizable. 


(d) Diagonalize the matrix 

We diagonalize the matrix A . From the computation of (a), the vectors 


'-1" 



1 

5 

0 

. 0 . 


. 1 . 


form a basis of the eigenspace E -2 associated to A — — 2 . 
Also the vector 














1 

-1 
_ 1 

form a basis of the eigenspace E i associated to A — 1. 
Therefore the vectors 



are linearly independent eigenvectors of A . 
Thus we set 


p = 

'-1 

1 

-1 1 ■ 

0 -1 

and D — 

'-2 

0 

0 O' 
-2 0 


0 

1 1 


0 

0 1 


and we have 


P ~ X AP = D. 




Consider the matrix A 


a 

b 


-b 

a 


, where a and b are real numbers and b ^ 0. 


(a) Find all eigenvalues of A. 

(b) For each eigenvalue of A, determine the eigenspace E\. 

(c) Diagonalize the matrix A by finding a nonsingular matrix S and a diagonal matrix D such that 

S~ 1 AS = D. 


Solution. 

(a) Find all eigenvalues of A . 

The characteristic polynomial p(t) of the matrix A is 


p(t ) = det(^4 — tl ) = 


-b 
a — t 


— (a — t ) 2 + b 2 . 


The eigenvalues ofv4 are roots of p(t). 
So we solve p(t) — 0 . We have 


(a — t) 2 + b 2 — 0 
<^> [a — t) 2 = —b 2 
<^> a — t = ±ib 
<^> t — a ± ib. 


Here i — y/— 1. 

Thus, the eigenvalues ofv4 are a zL ib. 

(b) For each eigenvalue of A , determine the eigenspace Ex . 



















We first determine the eigenspace Ex for X — a + ib . 

Recall that by definition Ex — AT {A — U) , the nullspace of A — XI . 


We compute 


A — (a + ib)I — 


—ib 

-b 1 


n 

—i 

R 2 —bR i 

1 —i 

i 

T 

-* 


i 

T 

-» 

i 

o 

o 


Note that in the above row reduction, we needed the assumption b ^ 0. 

It follows that the general solution of the system is x i = ix 2 . 

Hence, we have 

E a +ib = Span 



Note that the other eigenvalue a — ib is the complex conjugate of a + ib. 

It follows that the eigenspace E a -ib is obtained by conjugating the eigenspace E a +ib • 
Thus, 

E a -ib = Span 

(c) Diagonalize the matrix^ 



From part (b), we see that 

2 

form an eigenbasis for C . 
So, we set 


i 


—i 

and 

.1. 


. 1 . 


i 

—i 

and D — 

a + ib 0 

l 

1 . 


0 a — ib _ 


and we obtain S 1 AS — D by the diagonalization procedure. 




Determine all 2 X 2 matrices A such that A has eigenvalues 2 and 


— 1 with corresponding eigenvectors 


and 


2 

1 


, respectively. 


respectively. 

Solution. 

Suppose that A is a 2 X 2 matrix having eigenvalues 2 and —1 with corresponding eigenvectors 


























respectively. 

Then since A has two distinct eigenvalues, the matrix^ is diagonalizable. 
As we know eigenvectors, we can diagonalize A by the matrix 


S 


That is, we have 


S' 1 AS = 


The inverse matrix of S is given by 


S” 1 = 


1 2 
0 1 


2 0 

0 -1 


1 -2 

0 1 


A =S 


S 1 

1 -2 

0 1 


It follows that we have 

1 -2 

0 1 

2 0 

0 -1 

2 -6 
0 1 

Therefore, the only matrix satisfying the given conditions is 

^ 2 “ 6 

0 1 


1- 

'1 -2' 

-1 

-1 

T—1 

O 


$ Let A and B be n X n matrices. Suppose that A and B have the same eigenvalues Ai,..., A n with the same 
corresponding eigenvectors xi,..., x n . Prove that if the eigenvectors Xi,. . . , x n are linearly independent, then 

A = B. 

Proof. 

Since A and B have n linearly independent eigenvectors Xi,..., x n , they are diagonalizable. 
Specifically, if we put S = [xi,..., x n ]. 

Then S is invertible (as column vectors of S are linearly independent) and we have 

S~ l AS = D and S l BS = D, 


where D is the diagonal matrix whose diagonal entries are eigenvalues: 


D = 


Ai 

0 


0 

A 2 


0 

0 


0 0 •• • A n 


It follows that we have 


























S-'AS = D = S^BS, 

and hence A — B. This completes the proof. 


Suppose that A is a diagonalizable n X n matrix and has only 1 and — 1 as eigenvalues. Show that A 2 = I n , 
where I n is the n X n identity matrix. 

(Stanford University) 

Hint. 

Diagonalize the matrix A so that the diagonal matrix has ±1 on diagonal entries. 

Proof. 

Since A is diagonalizable, there exists an invertible matrix P such that P 1 AP — D , where D is a 
diagonal matrix. 

Since A has only ±1 as eigenvalues, we can choose P so that the diagonal entries of D are either 
± 1 . 

Then we have A = PDP~ l and 

A 2 = (PDP~ 1 )(PDP~ 1 ) = (PD/^^/DP- 1 ) = PD 2 P \ 

Note that D 2 = I 2 and thus we have A 2 = I 2 as required. 

Generalization 

As a generalization, consider the following problem. 


Problem. 

Suppose that A is a diagonalizable n X n matrix and the eigenvalues of A are r-th roots of 
unity for some positive integer r. 

Show that A r — I n , where I n is the n X n identity matrix. 


The proof of Problem 2 is a straightforward generalization of the proof of Problem 1. So give it a shot. 


Let 



be 3 X 3 matrix. Find 


lim A n . 

n—>00 


(Nagoya University Linear Algebra Exam) 




o The matri xA is symmetric, hence diagonalizable. 


£> Diagonalize A . 


You may want to find eigenvalues 

without computing the characteristic polynomial for A . 


Solution. 

Note that the matrix A is symmetric, hence it is diagonalizable. 
We observe several things to simplify the computation. 


First, note that the sum of the entries in each row is 1. 

(Such a matrix is call a stochastic matrix. See the comment below.) 


Thus the matrix A has eigenvalue 1 and 


1 

1 

1 


is an eigenvector. 


Also note that if we add 2/7(= —X) to diagonal entries, then every entry becomes 3/7. 
That is, 


A + 




3 " 

7 

3 

7 

3 

7- 


5 


which is clearly singular. 

From this observation, we notice that — 2/7 is an eigenvalue 



' 1" 


' 0' 

ofxi + x2 + x 3 = 0. Therefore 

-1 

and 

1 


0 


-1 


of A and eigenvectors are nonzero solution 
linearly independent eigenvectors for the 


eigenvalue —2/7. 

Hence the geometric (and hence algebraic) multiplicity of the eigenvalue —2/7 is 2. 

(Note that we found all eigenvalues of A without actually computing the characteristic polynomial.) 


1 

1 

1 


0 

1 

-1 


Now the invertible matix P = 


1 

-1 

0 


diagonalize A . Namely we have 

















Hence 


Therefore we have 


P~ l AP = 


A — P 


10 0 
0 -2/7 0 

0 0 —2/7 


0 

-2/7 

0 


0 
0 

-2/7. 



- V 

0 

0 


'i 

0 

o' 

A" =P 

0 

(-2/7 r 

0 

P 1 and lim A n =P 

0 

0 

0 


_ 0 

0 

(-2/7)”. 

n—yoo 

.0 

0 

0. 


, -1 


Find the inverse P 1 by your favorite method 


P~ l = 


1 

3 

2 

3 

1 

L 3 


Then the answer is 


lim A n = 

«—>■ oo 


-2 

3 J 


Comment. 

You might come up with the idea to use the diagonalization as it is an exam problem in linear algebra. 

But you soon realize that computing the characteristic polynomial for A is a bit messy. 

So sometimes it is important to find eigenvalues without finding the roots of the characteristic polynomial. 

We observed that 


O the sum of row entries of A is 1 


O if we add (2/7)7 to A then we obtain a matrix whose entries are all identical. 


These observation yields the eigenvalues of A without using the characteristic polynomial. 


If the sum of entries of each row of a matrix is 1, then the matrix is called called a stochastic matrix (or 
Markov matrix, probability matrix). 

The matrix A in the problem is an example of a stochastic matrix. 














Stochastic matrices have always 1 as an eigenvalue, 
see the post 

Stochastic matrix (Markov matrix) and its eigenvalues and eigenvectors 
for an example of a 2 X 2 stochastic matrix. 


Let 

"0 0 f 

10 0 . 

.0 1 0 . 

(a) Find the characteristic polynomial and all the eigenvalues (real and complex) of A. Is A diagonalizable over 
the complex numbers? 

(b) Calculate ^4 2009 . 

(Princeton University) 


Solution. 

(a) The characteristic polynomial and the eigenvalues 

The characteristic polynomial p[t) of the matrix A is the determinant of A — tl . We compute 
p{t) — det(^4 — tl) as follows. 

p{t) = det(^4 — tl) 

-t 0 1 

= 1 -f 0 

by the first row cofactor expansion 

Thus, the characteristic polynomial of the matrix A is 

p{t) — —V* + 1. 

The eigenvalues of the matrix A is roots of the characteristic polynomial. 

Hence solving — T 3 + l = 0,we obtain 

-1 ± y/Zi 

’ 2 

and these are all eigenvalues of A . 

The matrix A is a 3 X 3 matrix, and hence it has at most three distinct eigenvalues, and 

we found three distinct eigenvalues. In general, if an n X n matrix has n distinct eigenvalues, the matrix 

is diagonalizable. Thus the matrix A is diagonalizable. 

(b ) A 2009 

By direct computations, we have 














"o o r 


"0 0 1" 


"0 1 o' 

10 0 


10 0 

= 

0 0 1 

.0 1 0. 


.0 1 0. 


.1 0 0. 



'0 0 1 


'0 1 o' 

CO 

II 

to 

II 

10 0 


0 0 1 


0 10 


10 0 


where I is the 3 X 3 identity matrix. 

Therefore noting that 2009 = 3 • 669 + 2 , we have 

2009 3-669+2 _ ^3)669^2 

= 1 669 A 2 =A 2 . 


Therefore, we obtain 


A 


2009 


0 1 0 
0 0 1 
1 0 0 


Another solution using diagonalization 


Here, I give another solution for (b) using the diagonalization of the matrix A . 

For this particular matrix A , the above solution is easier since the power of A has a simple pattern. 

The following computation is lengthy, but I give it for a pedagogical reason. 

I just outline the solution and omit the detail computations. 

We obtained eigenvectors k — 0,1, 2 , where C is a primitive third root of unity in (a). 

" ^ 2 k' 


Eigenvectors are 


c* 

i 

for each eigenvalue . 
Then let 


5 = 


be the matrix whose columns are eigenvectors. 


It has the inverse 




c 

c c 2 

1 1 


S - 1 = 


1 1 

c c 2 
ic 2 c 


Then the matrix S diagonalize the matrix A and we obtain 

S^AS = 


10 0 
o c o 
Lo o c 2 J 






















Then we compute 



'1 

0 

0 




5 

0 

£-2009 

0 

s~ 

1 



.0 

0 

£- 2 ( 2009 ) 





"1 

0 

o' 


'0 

1 

o' 

5 

0 

to 

O 

S- 1 = 

0 

0 

1 


.0 

0 

C. 


1 

0 

0. 


© 


Let A be an n X n matrix with real number entries. Show that if A is diagonalizable by an orthogonal matrix, 
then A is a symmetric matrix. 


Proof. 

Suppose that the matrix A is diagonalizable by an orthogonal matrix Q . 
The orthogonality of the matrix Q means that we have 

Q t Q = QQ t =i, 

where Q T is the transpose matrix of Q and I is the n X n identity matrix. 


(*) 


Since Q diagonalizes the matrix A , we have 


Q~ l AQ = D , 


where D is a diagonal matrix. 

Equivalently, we have 

A = QDQ- 1 . (**) 

Taking transpose of both sides, we obtain 

A T = (QDQ - 1 ) T 

Moivr 1 

= (Q 1 ) t DQ 1 since D is diagonal. (***) 


By (*), we observe that the inverse matrix of Q is the transpose , that is, Q 1 = . 

It follows from this observation and (***) that we have 

A T = QDQ \ 

(Note that (Q~ 1 ) T = Q TT = Q-) 

Comparing this with (**), we obtain 

A t =A, 


and hence A is a symmetric matrix. 








Let 


A = 



-1 -1 

2 -1 

-1 2 


Determine whether the matrix A is diagonalizable. If it is diagonalizable, then diagonalize A. 


Solution 1. 


We claim that the matrix A is diagonalizable. 

One way to see this is to note that A is a real symmetric matrix, and hence it is diagonalizable. 


Alternatively, we can compute eigenspaces and check whether A is not defective (namely, the 
algebraic multiplicity and the geometric multiplicity of each eigenvalue of A are the same. 


To diagonalize the matrix A, we need to find eigenvalues and three linearly independent eigenvectors. 
We compute the characteristic polynomial of A as follows: 


pit) = 

det (A 

-tl) 




'2 -t 

-1 -1 1 



= 

-1 

2 -t -1 




-1 

-1 2 — t 





"2 -t -1 ■ 


“-1 -1 ■ 


■-1 2 -r 

= (2 - 1 ) 

-1 2 — t 

-(- i ) 

-1 2 — t 

+(- 1 ) 

-l -l 


(by the first row cofactor expansion) 
= - t(t - 3) 2 . 


Since eigenvalues are the roots of the characteristic polynomial, eigenvalues of A are 0 and 3 with 
algebraic multiplicity 1 and 2, respectively. 

(If you did not confirm that A is diagonalizable yet, then at this point we know that the geometric 
multiplicity of X = 0 is 1 since the geometric multiplicity is alway greater than 0 and less than or equal 
to the algebraic multiplicity. However the geometric multiplicity of 2 = 3 is either 1 or 2.) 


Next, we determine the eigenspace and its basis for each eigenvalue ofv4. 

For 2 = 3, we find solutions of (A - 37)x = 0. 

We have 


■-1 

-l 

-l 1 

K 2 Aj 

*3"*1 

‘-I 

-1 

-1 1 

-*1 

"1 

1 

1 ■ 

-l 

-l 

-l 


0 

0 

0 


0 

0 

0 

-l 

-l 

-l 


0 

0 

0 


0 

0 

0 


Hence a solution must satisfy x l = ~x 2 ~ x 3 , and thus the eigenspace is 




















"-1 ' 


■-1 1 


X e R 3 

<N 

X 

II 

M 

1 

+ X 3 

0 

for any x 2 , x 3 6 C \ 



0 


1 



From this expression, it is straightforward to check that the set 



is a basis of Ey 

(Hence the geometric multiplicity of X = 3 is 2, and A is not defective and diagonalizable.) 


For a = 0, we solve (A - 0I)x = 0, thus Ax = 0. 
We have 



' 2 

-1 

-1 ' 

R 



-1 

2 

-1 1 


' 1 

-2 

1 

■ 

A = 

-1 

2 - 

-1 


-> 


2 

-1 

-1 


2 

-1 

-1 


-1 

-1 

2 




-1 

-1 

2 


-1 

-1 

2 



2/?, 














Z 1 

R 3 +R i 

' 1 -2 

1 




•1 

-2 

1 " 

3 R 2 

-1 

-2 

1 1 



0 3 

- 

3 



0 

3 

-3 

-> 

0 

1 

-1 




o 

1 

3 




0 

0 

0 


0 

0 

0 



ri 0 -1 
0 1 -1 
0 0 0 


Hence any solution satisfies 


Therefore, the eigenspace is 


Xj = x 3 andx 2 = Xy 


E 0 = < x £ R 3 


for any x 3 6 C >, 


and a basis of E 0 is 



Let 































■-1 - 


■-1 ■ 


"1 " 

U 1 = 

1 

0 

, U 2 = 

0 

1 

U 3 = 

1 

1 


Note that {u 1? u 2 } is a basis of E 3 and {u 3 } is a basis of E 0 . 

Thus, it follows that the vectors u 1? u 2 , u 3 are linearly independent eigenvectors. 
Put 


S= [u 1 ? u 2 ,u 3 ] = 


"-1 

1 

0 


-1 

0 

1 


1 ' 

1 

1 


Since the columns of S are linearly independent, the matrix S is nonsingular. Then the procedure of 
the diagonalization yields that 


r3 


S~ l AS = 


0 

0 


0 

3 

0 


0 

0 

0 


where diagonal entries are eigenvalues corresponding to the eigenvector u 1? u 2 , u 3 with this order. 


Solution 2. 

The second solution uses a different method to find eigenvalues and eigenvectors. 

Let B = A - 37. Then every entry of B is -1. 

We find eigenvalues X and eigenvectors of B. Then the eigenvalues of A are X + 3 and eigenvectors are 
the same. 

First we reduce the matrix B as follows: 



‘-I 

-1 

-1 1 

K 2 K l 

R 3~Ri 

"-1 

-1 

-1 ■ 

-*1 

■1 

1 

1 1 

B = 

-1 

-1 

-1 

-► 

0 

0 

0 


0 

0 

0 


-1 

-1 

-1 


0 

0 

0 


0 

0 

0 


Hence the rank of B is 1, and the nullity of B is 2 by the rank-nullity theorem. It follows that X = 0 is 
an eigenvalue of B. 

Note that the eigenspace E 0 corresponding to X = 0 is the null space of A From the reduction above, 
we see that the null space consists of vectors 



'-1 1 


■-1 ■ 

<N 

X 

II 

M 

1 

0 

+ X 3 

0 

1 


for any complex numbers x 2 , x 3 . 
It follows that 






















■-1 ■ 


■-1 1 

1 


0 

0 


1 


are basis vectors of eigenspace E 0 . Hence the geometric multiplicity of X = 0 is 2. 
(The algebraic multiplicity is either 2 or 3. We will see it must be 2.) 


Since all the entries of B are -1, by inspection, we find that the vector 

"1 ' 

1 

1 

is an eigenvector corresponding to the eigenvalue - 3. 

In fact, we have 



■1 1 


"-1 

-1 

-1 1 


"1 1 


,_ 3 . 


■1 1 

B 

1 

= 

-1 

-1 

-1 


1 

= 

-3 

= -3 

1 


1 


-1 

-1 

-1 


1 


-3 


1 


Since the algebraic multiplicity of X = 0 is either 2 or 3, and the sum of all the algebraic multiplicities 
is equal to 3, the algebraic multiplicity of X = - 3 must be 1 and that of X = 0 is 2. 

Hence the geometric multiplicity of X = - 3 is 1. Thus 


ri 1 


1 

1 


is a basis vector of E_$. 


In a nutshell, we have obtained that eigenvalues of B are 0 and -3 and basis vectors of E 0 and E_^ 
are 


"-1 1 


"-1 1 


"1 ' 

1 


0 

and 

1 

0 


1 


1 


respectively. 

Since A = B + 37, the eigenvalues of A are 0 + 3 = 3 and -3 + 3 = 0 and the corresponding 
eigenvectors are the same. Thus 


B -1 1 


"-1 1 


■1 ' 

1 

? 

0 

and 

1 

0 


1 


1 
































are the basis vectors of the eigenspace E 3 and Eq of A, respectively. 


As in solution 1, we put 


5 = 


"-1 

1 

0 


-1 

0 

1 


1 ' 

1 

1 


Then we have 


r3 


S~ l AS = 


0 

0 


0 0 - 
3 0 

0 0 


where diagonal entries are eigenvalues corresponding to the eigenvector u 1? u 2 , u 3 with this order. 


© Let A be an n X n matrix with the characteristic polynomial 

Assume that the matrix A is diagonalizable. 

(a) Find the size of the matrix A. 

(b) Find the dimension of the eigenspace E2 corresponding to the eigenvalue A = 2. 

(c) Find the nullity of A. 

(The Ohio State University) 


Hint/Definition. 


© Recall that when a matrix is diagonalizable, the algebraic multiplicity of each eigenvalue is 
the same as the geometric multiplicity. 


© The geometric multiplicity of an eigenvalue A is the dimension of the eigenspace 
E\ — Af (A — XI) corresponding to A. 


© The nullity of A is the dimension of the null space Af(A) of A. 


Solution. 

(a) Find the size of the matrix A. 

In general, if A is an n X n matrix, then its characteristic polynomials has degree n. 

Since the degree of p(t) is 14, the size of A is 14 X 14. 

(b) Find the dimension of the eigenspace E 2 corresponding to the eigenvalue A = 2. 








Note that the dimension of the eigenspace E 2 is the geometric multiplicity of the eigenvalue A = 2 
by definition. 

From the characteristic polynomial p(t), we see that A = 2 is an eigenvalue of A with algebraic 
multiplicity 5. 

Since A is diagonalizable, the algebraic multiplicity of each eigenvalue is the same as the geometric 
multiplicity. 

It follows that the geometric multiplicity of A = 2 is 5, hence the dimension of the eigenspace E 2 is 
5. 

(c) Find the nullity of A. 

We first observe that A = 0 is an eigenvalue of A with algebraic multiplicity 3 from the 
characteristic polynomial. 

By definition, the nullity of A is the dimension of the null space Af(A), and furthermore the null 
space Af(A) is the eigenspace Eq. 

Thus, the nullity of A is the same as the geometric multiplicity of the eigenvalue A = 0. 

Since A is diagonalizable, the algebraic and geometric multiplicities are the same. Hence the nullity 
of A is 3. 


0 Let A be an n X n real symmetric matrix whose eigenvalues are all non-negative real numbers. Show that there is 
an n X n real matrix B such that B 2 — A. 



0 


(a) Consider the eigenvalues and the Jordan canonical form of A 


(b) Diagonalize the matrix A 


Solution. 

(a) Does there exist a 2 X 2 matrix^ with A 3 = O but A 2 7 ^ Ol 
We claim that there is no such 2x2 matrix A . 

Suppose that we have A 3 = O . This implies that all the eigenvalues of A are zero. 

To see this, let X be an eigenvalue of A and let x be a corresponding eigenvector. Then applying the 
defining relation Ax = Xx successively, we have 

0 — A 3 x — XA 2 x — X 2 Ax = X 3 x. 

Thus, the eigenvalue X must be zero since x is a nonzero vector since it is an eigenvector. 

Therefore, there exists an invertible matrix P such that the Jordan canonical form of A is 




P- X AP = 
Then A — P 


0 a 

0 0. 

0 a 

0 0 


for some number a. 


P 1 and we have 


A 2 = P 


'0 

a 

P 0 

( 

'0 

a 

A 


'0 

a 



\ p 



P 

= P 

.0 

0. 

) 

V 

.0 

0. 

) 


.0 

0. 


= P 


0 0 
0 0 


P 1 = 0. 


Hence A 2 must be zero, and we conclude that there is no 2 X 2 matrix A satisfying both A 3 — O and 
A 2 + O. 


(b) Does there exist a 3 X 3 real matrix B such that B 1 - A ? 

The answer is yes and we will construct such a matrix B as follows. 

The characteristic polynomial p(t) of the matrix A is 

p(t) det(^4 — tl ) = —t{t — l){t — 3). 

Thus the eigenvalues are X — 0,1,3 . 

The corresponding eigenvalues are obtained by solving the system (A — a/)x = 0 and we see that 


V 


' 1 ' 


‘ 1 ' 

i 

5 

0 

5 

-2 

_i. 


_-l_ 


_ 1 _ 


are eigenvectors corresponding to eigenvalues 0,1,3, respectively. 
Thus the matrix 

1 


P = 


1 1 
10-2 
1-11 


is invertible and diagonalize the matrix A such that 


P~ l AP = 


0 0 0 
0 10 
0 0 3 


Then if we have the equality B 2 = A , then we need to have 
(P 1 BP) (P 1 BP) =p~ X AP = 


0 0 0 
0 10 
0 0 3 


From this we see that the matrix 


B = P 


0 0 0 
0 10 
.0 0 Vs. 


, -1 




























satisfies the above relation. 


In fact, we compute 


B 2 = 


0 

0 

0 ' 

\ 


0 

1 

0 

p ~ 1 

p 

0 

0 

A. 

) 

. 


0 0 
0 1 
0 0 


0 
0 
AJ 



'0 

0 

0 ' 

2 

'0 

0 

o' 

p 

0 

1 

0 

p 1 = p 

0 

1 

0 


.0 

0 

vT 


.0 

0 

3. 


i -1 


, -1 


= A. 


Remark that the matrix B is real since P is real. Thus the matrix B satisfies the conditions of the problem. 
Remark. 

The explicit form of the matrix P was used to assure that the matrix B is real. 

So we did not have to compute the matrix P explicitly if we use the fact that the symmetric matrix is 
diagonalizable by a real orthogonal matrix. 

Generalization. 

See problem A square root matrix of a symmetric matrix with non-negative eigenvalues for a more general 
question than part (b). 


Furthermore, try the next problem. 




Find a square root of the matrix 


A = 


0 0 


-3 

5 

9 


Flow many square roots does this matrix have? 

(i University of California, Berkeley) 

Proof. 

We will find all matrices B such that B 2 = A. Such matrices B are square roots of the matrix A. 


Note that since A is a diagonal matrix, the eigenvalues of A are diagonal entries 1,4,9. Since A has 
three distinct eigenvalues, it is diagonalizable. 

Solving (.A — A/)x = 0 for A = 1,4,9, we find eigenvectors corresponding to eigenvalues 1,4,9 
are respectively 


'1" 


"1" 


"o' 

0 

5 

1 

? 

1 

.0. 


.0. 


_1_ 


Thus the invertible matrix 


P = 


1 1 
0 1 
0 0 


0 

1 

1 





















diagonalizes the matrix A , that is, we have 


P~ 1 AP = 


10 0 ' 
0 4 0 
0 0 9 


Then if B 2 = A, then we have (P 1 BP) (P 1 B) = P 1 AP. 

Let A' = P 1 AP and B' = P l BP. 

Since we have B ' 2 — A' , we have B' A' — B ' 3 — A'B'. 

Since A' is diagonal with distinct diagonal entries, this implies that B' is also a diagonal matrix. 


A diagonal matrix B' satisfying B ' 2 = A! = 


10 0 
0 4 0 
0 0 9 


is one of 


Hence B must be one of 


±10 0 
0 ±2 0 
0 0 ±3 


±10 O' 
0 ±2 0 
0 0 ±3 


P 1 . 


The inverse matrix of P can be calculated as 


P 1 

\ here tore, all the square roots of the matrix A are 


1-11 
0 1 -1 

0 0 1 


"1 

1 

o' 


0 

1 

1 


.0 

0 

1 . 



±10 0 
0 ±2 0 
0 0 ±3 


1 -1 

0 1 

0 0 


1 

-1 

1 


and we have 8 square root matrices. 


For example, when the diagonal matrix has all positive entries, then one of the square roots is 


1 

1 

o' 


'1 

0 

o' 


'1 

-1 

1 " 


'1 

1 

-1" 

0 

1 

1 


0 

2 

0 


0 

1 

-1 

= 

0 

2 

1 

0 

0 

1 


0 

0 

3 


0 

0 

1 


0 

0 

3 


Related Question. 

Problem. 

Prove that a positive definite matrix has a unique positive definite square root. 


























Suppose the following information is known about a 3 X 3 matrix A. 



1 


1 


1 


1 


2 


1 

A 

2 

= 6 

2 

, A 

-1 

= 3 

-1 

, A 

-1 

= 3 

-1 


1 


1 


1 


1 


0 


1 


(a) Find the eigenvalues of A. 

(b) Find the corresponding eigenspaces. 

(c) Is A a diagonalizable matrix? Is A an invertible matrix? Is A an idempotent matrix? 
(Johns Hopkins University) 


Solution. 

(a) Find the eigenvalues of A. 

From the first and the second equations, we see that 6 and 3 are eigenvalues of A. 
From the second and the third equations, we have 



' 1 ' 


' 1 " 


" 2 " 

A 

-1 

= 3 

-1 

= A 

-1 


1 


1 


0 


and thus we have 


'O' 

1 

■ i ■ 


' 2 ' 

\ 

'-1" 

0 

=4 

-i 

— 

-1 

= A 

0 

.0. 

\ 

. i _ 


. 0 . 

) 

. 1 _ 


This yields that 



"-1" 


"-1" 

A 

0 

= 0 

0 


1 


1 


(*) 


and we conclude that 0 is an eigenvalue. 

Since the size of the matrix A is 3 X 3, it has at most three eigenvalues. Thus we found them all. 
Eigenvalues of A are 0, 3, 6. 


(b) Find the corresponding eigenspaces. 

is an eigenvector corresponding to the eigenvalue 6. 


From the first equation, the vector 


1 

-1 

1 


We also see from the second equation, the vector 
eigenvalue 3. 

The equation (*) in the solution (a) also tells us that the vector 
corresponding to the eigenvalue 0. 


is an eigenvector corresponding to the 


-1 

0 

1 


is an eigenvector 






































Note that we have three distinct eigenvalues for 3 X 3 matrix A. So all the algebraic multiplicities 
for eigenvalues are 1, hence the geometric multiplicities must be 1 since the geometric multiplicity is 
always less than or equal to the algebraic multiplicity. 

Therefore, each eigenspace has dimension 1. We already found a nonzero vector in each eigenspace, 
and thus the vector is a basis vector for each eigenspace. 

We denote E\ for the eigenspace for eigenvector A. Then we have 


Eq — Span 



E 3 — Span 



Eq — Span 



(c) Diagonalizable matrix? Invertible matrix? Idempotent matrix? 


Since A has three distinct eigenvalues, A is diagonalizable. (Or, algebraic multiplicities are the same 
as geometric multiplicities.) 

Since the matrix A has 0 as an eigenvalue. Thus A is not invertible. 


If A is idempotent matrix, then the eigenvalues of A is either 0 or 1. Thus A is not an idempotent 
matrix. 

(For a proof of this fact, see the post Eigenvalues of an idempotent matrix.) 




Diagonalize the matrix A = 
that S~ 1 AS = D. 

(The Ohio State University) 


1 1 1 
1 1 1 
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. Namely, find a nonsingular matrix S and a diagonal matrix D such 


Hint. 

To diagonalize the matrix A , we need to find eigenvalues A and bases of eigenspaces. 


For a procedure of the diagonalization, see the post “How to Diagonalize a Matrix. Step by Step 
Explanation? 4 . 

Below, we will find eigenvalues and eigenvectors without using the characteristic polynomial although you 
may use it. 

Solution. 

We use an indirect way to find eigenvalues and eigenvectors. 

(We will not use the characteristic polynomial.) 


Applying the elementary row operations, we have 



"1 

1 

r 

R 2 —R 1 

'1 

1 

1 

A = 

1 

1 

i 

Rz—Ri 
- > 

0 

0 

0 


1 

1 

i 


0 

0 

0 

















Hence the solutions x of Ax — 0 satisfy 


Xi = —x 2 x 3 . 


Thus, every vector in the null space is of the form 



1 

1 

X 

to 

1 

CO 


'-1" 


'-1" 

X = 

1 

X X 

CO to 

= X 2 

1 

. 0 . 

+ X 3 

0 

_ 1 _ 


for some scalars x 2 , x 3 . 


It follows that 

■-n ' 

0 > 

. 1J > 

is a basis of the null space Af (A) . 



Hence 0 is an eigenvalue and the geometric multiplicity corresponding to 0, which is the nullity of A , is 2. 
It follows that the algebraic multiplicity of the eigenvalue 0 is either 2 or 3. 

We see that it is 2 shortly. 


Note that by inspection we have 


This yields that 3 is an eigenvalue of A and 


= 3 


is a corresponding eigenvector. 


The sum of algebraic multiplicities of all eigenvalues of A is 3. 

Hence the algebraic multiplicity of 0 must be 2, and that of 3 must be 1. 


In particular, the vector 


1 

1 

1 


forms a basis of the eigenspace E 3 . 


In summary so far, we have eigenvalues 0, 3 and basis vectors of eigenspaces are 


'-1' 

1 

? 

'-1" 

0 

and 

'1 

1 

. 0 . 


. 1 . 


.1. 


respectively. 
Thus, we put 



























and obtain 


where 




Prove that the matrix 


5 ~ X AS = D, 


D = 


0 0 
0 0 
0 0 


0 

0 

3 



1 

0 


is diagonalizable. 

Prove, however, that A cannot be diagonalized by a real nonsingular matrix. 

That is, there is no real nonsingular matrix S such that S' -1 AS is a diagonal matrix. 


Proof. 

We first find the eigenvalues of A by computing its characteristic polynomial p(t). 
We have 


p(t) = det (A — tl) = 


-t 

-1 


1 

-t 


= t* 


+ 1 . 


Solving p(t) — t 2 + 1 — 0 , we obtain two distinct eigenvalues =bz of A . 
Hence the matrix A is diagonalizable. 


To prove the second statement, assume, on the contrary, that A is diagonalizable by a real nonsingular 
matrix S. 

Then we have 

i 0 ' 

0 -i. 


S-'AS = 


by diagonalization. 

As the matrices A , S are real, the left-hand side is a real matrix. 
Taking the complex conjugate of both sides, we obtain 


- 1 

<1 

o 

_1 


o 

= S~ l AS =S- X AS = 

l 

o 

1_ 

— 

0 i _ 


LO -i \ 


LO -i J 


This equality is clearly impossible. 

Hence the matrix A cannot be diagonalized by a real nonsingular matrix. 
























Let A — 


— a a 
—a 1 + a 
that the matrix A is diagonalizable. 
(Nagoya University) 


be a 2 X 2 matrix, where a is a complex number. Determine the values of a such 


Problem 459 


(Nagoya University, Linear Algebra Exam Problem) 

Proof. 

To find eigenvalues of the matrix^ , we determine the characteristic polynomial p(t) of A as follows. 
We have 


p(t) = det(A - tl) = 

—a 1H 

= (1 — a — f)(l + a — t) + a 2 
= (1 - t) 2 - a 2 + a 2 = (1 - t ) 2 . 


Note 

Thus, the eigenvalue of A is 1 with algebraic multiplicity 2. 


Let us determine the geometric multiplicity (the dimension of the eigenspace E i). 
We have 


—a 

a 

R 2 —R 1 

—a 

a 

_—a 

a _ 

-> 

_ 0 

0 . 


If a ^ 0, then we further reduce it and get 


—a 

a 


1 

-l 

_ 0 

0 . 

- > 

.0 

0 . 


Hence the eigenspace corresponding to the eigenvalue 1 is 


Ei 


N(A — I) = Span 




1 

1 


5 


and its dimension is 1 , which is less than the algebraic multiplicity. 
Thus, when a 7 ^ 0, the matrix^ is not diagonalizable. 


If a — 0, then the matrix A 


1 0 
0 1 


is already diagonal, hence it is diagonalizable. 





















In conclusion, the matrix A is diagonalizable if and only if a — 0. 


m 


Consider the 2x2 complex matrix A — 


a 

0 


b — a 
b 


(a) Find the eigenvalues of A. 

(b) For each eigenvalue of A, determine the eigenvectors. 

(c) Diagonalize the matrix A. 

(d) Using the result of the diagonalization, compute and simplify A k for each positive integer k. 


Solution. 

(a) Find the eigenvalues of A . 

Since A is an upper triangular matrix, eigenvalues are diagonal entries. 

Hence a, b are eigenvalues of A . 

(b) For each eigenvalue of A , determine the eigenvectors. 

Suppose now that a ^ b. 

Let us find eigenvectors corresponding to the eigenvalue a. 

We have 

0 1 
0 0 


ro 

b — a 

R2—R1 

ro 

b — a 

T- -Kl 

b—a 

i 

o 

i 

1 

- > 

1 

0 

0 

-» 



It follows that the eigenvectors corresponding to a are 

fl 


where x i is any nonzero complex number. 


Next, we find the eigenvectors corresponding to the eigenvalue b. 

We have 

1 -1 

0 0 


A — bl — 


a — b b — a 

0 0 




Hence the eigenvectors corresponding to b are 


xi 


where x i is any nonzero complex number. 


(c) Diagonalize the matrix^ . 

When a = b, then A is already diagonal matrix. So let us consider the case a ^ b. 

In the previous parts, we obtained the eigenvalues a, b , and corresponding eigenvectors 























Let S = 


1 1 

.0 1 

Then S is invertible and we have 


be a matrix whose column vectors are the eigenvectors. 


S^AS = 


a 

0 


0 

b 


by the diagonalization process. 

Remark that this formula is also true even when a — b. 


(d) Using the result of the diagonalization, compute and simplify // k for each positive integer k. 


Using the result of the diagonalization in part (c), we have 


A = S 


For each positive integer k , we have 


i-l 


A k = 


In summary, we have the formula 


0 Consider the complex 


matrix 


a 0 
0 b 


= S 


a 0 

k 

~a k 0 ' 


S= s 


.0 b 


. 0 ZA. 


r a k 



i 

o 

1_ 


'1 -1‘ 


■3a 

o 

_i 


-1 

i —1 

o 
_1 


tk -a kl 

ZA 


A k = 


r a k 


ZA -a kl 
ZA 


A = 


\/2cosa; 

zsinz 

0 


i sinx 
0 

—i sin x 


0 

—isinx 
— \/2cos£ 


where x is a real number between 0 and 27r. Determine for which values of x the matrix A is diagonalizable. 
When A is diagonalizable, find a diagonal matrix D so that P 1 AP = D for some nonsingular matrix P . 


Solution. 


Let us first find the eigenvalues of the matrix A . 

To do so, we compute the characteristic polynomial p(t) = det(v4 — tl ) ofv4 as follows. 
Using Sarrus’s rule to compute the 3 X 3 determinant, we have 































p{t) — det(x4 — tl) 

\/2cos x — t zsinx 0 

= zsinx —t —zsinx 

0 —zsinx —\/2cos x — t_ 

— —t(^/2c osx — t)(—y/2 cosx — t) — ( — (sin 2 x)(— y/2 cosx — t) — (sin 2 x)(y / 2 cosx — t )) 
= — t 3 + 2(cos 2 x — sin 2 x)^ 

= —t 3 + 2cos(2 x)t. 

The eigenvalues of A are the roots of 

p(t) = —t 3 + 2cos(2 x)t — —t(t 2 — 2cos(2x)). 

Hence the eigenvalues are 

t — 0, 2 cos(2x). 


Note that if y^2 cos(2x) = — y^2 cos(2x) then we have cos(2x) = 0 and hence x — n/ 4,37r/4 . 
It follows that if x = nj 4, 3 nj4 , then the matrix A has only one eigenvalue 0 with algebraic 
multiplicity 3. 

Since A is not the zero matrix, the rank of A is greater than or equal to 1. 

Hence the nullity of A is less than or equal to 2 by the rank-nullity theorem. 

It follows that the geometric multiplicity (=nullity) of the eigenvalue 0 is strictly less than the algebraic 
multiplicity of 0 and A is not diagonalizable in this case. 


Now suppose that x 7t/ 4, 37r/4 . 

In this case, the matrix A has three distinct eigenvalues 0, =b^/2 cos(2x) . 

This implies thatv4 is diagonalizable. 

Let v i, V 2 , V 3 be eigenvectors corresponding to eigenvalues 0, =b y^2 cos(2x) , respectively. 
Define the 3 X 3 matrix PbyP^fvi V 2 v 3 ]. 

It follows from the general procedure of the diagonalization that P is a nonsingular matrix and 

P~ X AP =D, 


where D is a diagonal matrix 


D = 


0 0 0 

0 y 2 cos( 2 x) 0 

0 0 — ,/ 2 cos( 2 x) 


Summary 

In summary, when x — jt/4, Zn/ 4 the matrix A is not diagonalizable. 

When x 7 ^ nj 4,3 n/ 4 , the matrix A is diagonalizable and we can take the diagonal matrix D as 













D = 


0 

0 

0 


0 

y/2 cos(2x) 

0 


0 

0 

— y/2 cos(2x) 




Consider the Hermitian matrix A 


1 i 

—i 1 


(a) Find the eigenvalues of A. 

(b) For each eigenvalue of A, find the eigenvectors. 

(c) Diagonalize the Hermitian matrix A by a unitary matrix. Namely, find a diagonal matrix D and a unitary 
matrix U such that U~ l AU = D. 


Solution. 

(a) Find the eigenvalues of .T. 

To find the eigenvalues of the matrix A , we first compute the characteristic polynomial p( t) of A . 
We have 


p{t) — det(^ — tl) — 

—i 1 — t 

= (1 - f)(l - 1) - (i)(-/) = t 2 - 2t + 1 - 1 
= t(t~ 2). 


Solving p{t) = 0 , we obtain the eigenvalues 0 and 2. 

(b) For each eigenvalue of A , find the eigenvectors. 

Let us first find the eigenvectors corresponding to the eigenvalue 0. 
We solve the equation (A — 0/)x = 0 . 

We have 

R 2 +iR i I" 1 i 
A -> 

Lo oj 


Hence the eigenvalues are 


for any nonzero complex number c. 


c 


1 

i 


Next, we find the eigenvectors corresponding to the eigenvalue 2. We solve the equation (A — 21) x = 0 
We have 
















Hence the eigenvalues are 


' 1 

—i 

R2+iR i 

1 —i 

—i 

- 1 . 

- » 

-1 

o 

o 

_1 


c 



for any nonzero complex number c. 

(c) Diagonalize the Hermitian matrix A by a unitary matrix. 

To diagonalize the Hermitian matrix A by a unitary matrix U, we find an orthonormal basis for each 
eigenspace of A . 

As each eigenspace of A is 1-dimensional by part (b), we just need to normalize any eigenvector for each 
eigenvalue. 


By part (b), we know that v i 
The length of this vector is 


1 

i 


is an eigenvector corresponding to the eigenvalue 0. 


Ihill — yh • 1 + ( z ’)( — 0 — a / 2 - 


Hence the vector 

1 

i 

is a unit eigenvector. 


Ul = 


vi 

ll v i| 


1 

\/2 


Similarly, from (b) we see that 

the vector is y/2 . 

Hence 



is an eigenvector corresponding to the eigenvalue 2. The length of 


is a unit eigenvector. 




It follows that the matrix 

1 1 

i —i 


u = [ui u 2 ] = 


a/2 


is unitary and 
























U~ l AU = 


0 0 
0 2 


by diagonalization process. 

Let A be an n X n complex matrix. Let S be an invertible matrix. 

(a) If SAS -1 = A A for some complex number A, then prove that either A n = 1 or A is a singular matrix. 

(b) If n is odd and SAS -1 = — A, then prove that 0 is an eigenvalue of A. 

(c) Suppose that all the eigenvalues of A are integers and det(^4) > 0. If n is odd and SAS _1 = A~ 1 , then 
prove that 1 is an eigenvalue of A. 


Proof. 

Basic Properties of Determinants 


We use the following properties of determinants of matrices. 
For ft X n matrices A, B , we have 

(J) det(vf8) = det(v4) det(5) . 


^det(^4 l ) — det(v4) 1 if A is invertible. 


(J) det(c^4) = c n det(^4) 


for any complex number c. 


(a) If SAS 1 = aA , then prove that A n = 1 or A is a singular matrix. 

Suppose that we have SAS -1 = Li . 

We consider the determinants of both sides and we have 

detiSAS- 1 ) = det(Li). 

The left hand side becomes 

det(S/l,S x ) = det(S) det(d) det(5 x ) by property 1 

= det(S) det(d) det(.S') 1 by property 2 

= det(^). 

The right hand side is by property 3 

det (U)=A n det(.d). 

Hence we obtain 

det(^) =X n det(^4). 









It follows from this that either A n — 1 ordet(v4) = 0. 

In conclusion, either A n — 1 or A is a singular matrix. 

(b) If n is odd and SAS _1 = —A , then prove that 0 is an eigenvalue of A . 

Suppose that we have SAS _1 = — A . 

Then we have 


det(^4) = det(£) det(^4) det(S') 1 
= det(S) det(^)det(5“ 1 ) 

= det(SAS~ 1 ) 

— det(— A) 

= {-l) n det(v4) 

= -det(^) 


by property 2 
by property 1 
by assumption 
by property 3 
since n is odd. 


This yields that det(^4) = 0 . 

Note that the product of all eigenvalues ofv4 is det (^4) . 

(See the post “Determinant/Trace and Eigenvalues of a Matrix” for a proof.) 
Thus, 0 is an eigenvalue of A . 


(c) Note that as det (A ) > 0, the matrix^ is invertible. 
Suppose that SAS -1 = A -1 . 

Then we have 

det(^4) = det^) det(J) det(5') _1 
= det(.S') det(v4) det^ -1 ) 

= det^.S" 1 ) 

= det(J -1 ) 

= det(^4) _1 


by property 2 
by property 1 
by assumption 
by property 2. 


Thus we have det (A) 2 = 1 . hence det(. l) = 1 as det (.-!) > 0 by assumption. 

Note again that the product of all eigenvalues of A is det(.i). 

Since the eigenvalues of A are integers by assumption, the eigenvalues of A are either 1 or — 1 . 

If all of the n eigenvalues of A are — 1 , then the determinant is 

det(v4) = (-1)" = -1 

since n is odd, and this is a contradiction. 

Thus, at least one of the eigenvalues must be 1 . 

This completes the proof. 


Similar Transformation (conjugate) 


A transformation A to SAS 1 , for some invertible matrix S , is called a similarity transformation or 
conjugation of the matrix A . 


We say that matrices A and B are similar if there exists an invertible matrix S such that B — SAS 1 
In other words, A is similar to B if there is a similarity transformation from A to B . 

Check out the following problems about similar matrices. 


Problem. Is the matrix A — 

-i 

6' 

similar to the matrix B — 

' i 

2 


-2 

6 


-i 

4 


For a solution together with similar problems, see the post “Determine whether given matrices are similar 46 . 


Problem. Let A, B be matrices. Show that if A is diagonalizable and if B is similar to A , then B is 
diagonalizable. 


For a proof, see the post “A matrix similar to a diagonalizable matrix is also diagonalizable 44 . 

Let A be a real skew-symmetric matrix, that is, A T — —A. Then prove the following statements. 

(a) Each eigenvalue of the real skew-symmetric matrix A is either 0 or a purely imaginary number. 

(b) The rank of A is even. 


Proof. 

(a) Each eigenvalue of the real skew-symmetric matrix A is either 0 or a purely imaginary number. 


Let X be an eigenvalue of A and let x be an eigenvector corresponding to the eigenvalue X . That is, we 
have 


Ax = Xx. 


— T 

Multiplying by x from the left, we have 

x t Ax — Xx T x = A||x|| 2 . (*) 

Note that the left hand side x T Ax is the dot (inner) product of x and Ax . Since the dot product is 
commutative, we have 


The left hand side of (*) 
= x T Ax = (^x) T x 


Since A is skew-symmetric, we have A 


— x 
T _ 


t a t 

-- -A 


X. 

. Substituting this into the above equality, we have 


The left hand side of (*) 

t A T- t A - 

— x A x — —x Ax 


Taking conjugate of Ax — Xx and use the fact that A is real, we have 

Ax — Xx. 







Thus, we have 

The left hand side of (*) 

= -x T A5t 

— —x T Xx = — X | |x| | 2 . 

Therefore comparing the left and right hand sides of (*) yields 


—X IIxl1 2 = /l||x|| 2 . 


Since x is an eigenvector, it is nonzero by definition. Thus | |x| | 7 ^ 0 . 


Hence we have 


-X = X, 

and this implies that X is either 0 or purely imaginary number. 

(To see this, let X — a + ib , where a, b £ R. Then 

—X — —a + ib — a + ib — X 

implies a — 0, thus X — bi .) 

Remark: Another similar proof is to take conjugate of (*). 

(b) The rank of A is even 

From part (a), we know that the eigenvalues of A are 0 or purely imaginary. 

Thus if X is a purely imaginary eigenvalue of A , then its conjugate X — —X is also an eigenvalue of A 
since A is a real matrix. 

Thus, nonzero eigenvalues come in pairs A, —X (and their algebraic multiplicities are the same). 

Let 


X r 5 X r j X 2 j X 2 5 • • • 5 X ^ X 


be nonzero eigenvalues of A . 


Since a real skew-symmetric matrix is normal, it is diagonalizable (by a unitary matrix). 

Thus there exists an invertible matrix P such that 

P l AP = 

diag [2 1 -X x A 2 -A 2 ••• X k -X k 0 ...0], 

where diag [a i , a 2 ,..., a n \ denotes the n X n matrix whose diagonal entries are a \, a 2 ,..., a n and 
all the off diagonal entries are zero. 


Since P is an invertible matrix, the rank of A is the same as the rank of the diagonal matrix on the right- 
hand side, which is easily seen to be 2k . 

Thus the rank of A is 2k, and we have proved that the rank of the real skew-symmetric matrix A is even. 

Related Question. 

As an application of this problem, try the following problem. 


Problem 1 . If A is a skew-symmetric matrix, then I + A and I — A are nonsingular matrices, 
where / is the identity matrix. 


For a proof, see the posLr 

If A is a Skew-Symmetric Matrix, then I + A is Nonsingular and (/ — A) (/ + v4) _1 is Orthogonal. 
Problem 2. Prove that the determinant of an n X n skew-symmetric matrix is 0 if n is odd. 


For a proof, check out the post i 

The Determinant of a Skew-Symmetric Matrix is Zero. 

More Eigenvalue and Eigenvector Problems 

Problems about eigenvalues and eigenvectors are collected on the page: 

Eigenvectors and Eigenspaces 


3 Let A be an n X n real symmetric matrix. Prove that there exists an eigenvalue A of A such that for any vector 
v E M n , we have the inequality v • Av < A|| v|| 2 . 

Proof. 

Recall that all the eigenvalues of a symmetric matrices are real numbers. 

Let kp ..., A n be eigenvalues of A . 

Since these eigenvalues are real numbers, there is the largest one. 

Let A be the largest eigenvalue of A. 

With this choice of A we show that the inequality 

v • Ax < AM 2 


holds for any v6R”. 


Also recall that for a real symmetric matrix, there are eigenvalues v 1? ..., \ n corresponding to 
Ap ...,A n such that 


form an orthonormal basis of R n . 

(This statement is equivalent to that every real symmetric matrix is diagonalizable by an orthogonal 
matrix.) 

Let v be an arbitrary vector in R n . 

Then since B is a basis of R n , we can write 

y = c x y l + - + c n y n 

for some c 1? ..., c n E R. 

Then we calculate 


Av = A(c 1 v 1 + -+c n v n ) 


= c 1 Av l + - + c n Av i 


n 



since A\ f = 2-v- for i = 


Using this, we have 


y-Ay = (cjVj + - + c„v B ) • (c x k jVj + - + c„A„v„) 

= c\x x + -+ c lx n - 


Here, we used that B = {vj,v (; j is an orthonormal basis of R 3 . 
That is, we used the properties 



Since X is the largest eigenvalue of A, we have 


y ■ Ay = c\k x + - + c\l n 


< c j A + ••• + 

= l(cj + ■■■ + c l) 
= Allvll 2 . 


Hence the required inequality holds. 


(J) A real symmetric n X n matrix A is called positive definite if x T ^lx > 0 for all nonzero vectors x in 

(a) Prove that the eigenvalues of a real symmetric positive-definite matrix A are all positive. 

(b) Prove that if eigenvalues of a real symmetric matrix A are all positive, then A is positive-definite. 




Proof. 

(a) Prove that the eigenvalues of a real symmetric positive-definite matrix^ are all positive. 

Recall that the eigenvalues of a real symmetric matrix are real. 

(See the corollary in the post “Eigenvalues of a Hermitian matrix are real numbers 46 .) 

Let A be a (real) eigenvalue of A and let x be a corresponding real eigenvector. That is, we have 

Ax = Ax. 


T 

Then we multiply by x on left and obtain 

x T Ax = Ax T x 
= A||x|| 2 . 

The left hand side is positive as A is positive definite and x is a nonzero vector as it is an eigenvector. 
112 

Since the length | |x| | is positive, we must have A is positive. 

It follows that every eigenvalue A of A is real. 

(b) Prove that if eigenvalues of a real symmetric matrix^ are all positive, then A is positive- 
definite. 


Note that a real symmetric matrix is diagonalizable by an orthogonal matrix. 

So there exists an on orthogonal matrix Q such that Q T AQ — D , where D is the diagonal matrix 


D = 


Ai 0 

0 A 2 

0 0 


0 

0 


0 

0 

A n - 


whose diagonal entries A / are eigenvalues of A , which are positive by assumption. 


Let x be an arbitrary nonzero vector in J§L n . 

Since A — QDQ T (remark that Q 1 — Q T ), we have 

x T Ax = x T QDQ T x. 

rr\ 

Putting y = Q x , we can rewrite the above equation as 

x t Ax = y T Dy. 


Let 


y l 

^2 

y = . 

.y n . 

Then we have 







x t Ax = y T Dy 


= [yi y 2 


yn 


'Ai o 

o a 2 


o o 

— ^ \ y \ + ^ 2^1 + ' ' ' + ^ h^h• 


0 0 
0 0 



~y i" 


yi 


.y n _ 


By assumption eigenvalues X / are positive. 

Also, since x is a nonzero vector and 2 is invertible, y = <Q T x is not a zero vector. 

Thus the sum expression above is positive, hence x T Ax is positive for any nonzero vector x. 
Therefore, the matrix A is positive-definite. 

Related Questions. 


Problem. Let A be an n X n real matrix. Prove the followings. 

(a) The matrix AA T is a symmetric matrix. 

(b) The set of eigenvalues of A and the set of eigenvalues of A T are equal. 

(c) The matrix A A T is non-negative definite. 

(An n X n matrix B is called non-negative definite if for any n dimensional vector x, we have 
x T 5x > 0 .) 

(d) All the eigenvalues of AA T is non-negative. 


For a solution, see the post “Transpose of a matrix and eigenvalues and related questions. “. 

Problem. Suppose A is a positive definite symmetric n X n matrix. 

(a) Prove that A is invertible. 

(b) Prove thatv4 _1 is symmetric. 

(c) Prove that A _1 is positive-definite. 


For proofs, see the post “Inverse matrix of positive-definite symmetric matrix is positive-definite 44 . 


Problem. 

Prove that a positive definite matrix has a unique positive definite square root. 


For a solution of this problem, see the post 

A Positive Definite Matrix Has a Unique Positive Definite Square Root 






<& Suppose A is a positive definite symmetric n X n matrix. 

(a) Prove that A is invertible. 

(b) Prove that A -1 is symmetric. 

(c) Prove that A -1 is positive-definite. 

(jMIT) 


Proof. 

(a) Prove that A is invertible. 

Recall that a symmetric matrix is positive-definite if and only if its eigenvalues are all positive. 

Thus, since A is positive-definite, the matrix does not have 0 as an eigenvalue. 

Hence A is invertible. 

(b) Prove that A _1 is symmetric. 

By part (a), we know that A is invertible. We have 

A ~ 1 A = I, 

where I is the n X n identity matrix. 

Taking the transpose, we have 

/ = / T = (A _1 A) T 
= A t (A -1 ) t 

= A (A _1 ) T since A is symmetric. 


It follows that A 1 = (A 1 ) T , and hence A 1 is a symmetric matrix. 

(c) Prove that A _1 is positive-definite. 

Again we use the fact that a symmetric matrix is positive-definite if and only if its eigenvalues are all 
positive. 

(See the post “Positive definite real symmetric matrix and its eigenvalues” for a proof.) 

All eigenvalues of A ~ 1 are of the form 1/A , where A is an eigenvalue of A . 

Since A is positive-definite, each eigenvalue A is positive, hence 1/A is positive. 

So all eigenvalues of A -1 are positive, and it yields that A _1 is positive-definite. 


Let A = 


1 

4 


2 

3 


(a) Find eigenvalues of the matrix A. 

(b) Find eigenvectors for each eigenvalue of A. 

(c) Diagonalize the matrix A. That is, find an invertible matrix S and a diagonal matrix D such that 

S~ 1 AS = D. 


(d) Diagonalize the matrix A? — 5 A 2 + 3 A + I, where / is the 2 X 2 identity matrix. 

(e) Calculate A 100 . (You do not have to compute 5 100 .) 

(f) Calculate (A 3 — 5A 2 + 3A + J) 100 . Let w — 2 100 . Express the solution in terms of w. 




Solution. 

(a) Find eigenvalues of the matrix^ . 

To find the eigenvalues of A , we calculate the characteristic polynomial p{t) as follows. 
We have 


p(t ) = det(v4 — tl ) = 

= (1 - 0(3 - t) - 8 = t 2 | 4t - 5 = [t + l)(t - 5). 

The eigenvalues ofv4 are roots of its characteristic polynomial p(t). 

Hence the eigenvalues of A are — 1 and 5. 

(b) Find eigenvectors for each eigenvalue of A . 

We first determine the eigenvectors of the eigenvalue —1 by solving the system (A + /)x = 0 . 
We have 


1-t 2 

4 3 — t 


1 

to 

to 
_1 

R 2 ~2Ri 

'i 

r 

1 

1 

then \Ri 

O 

_ i 

0. 


This yields that the eigenvectors corresponding to — 1 are 


1 

-1 


for any nonzero scalar a. 

Next, we find the eigenvectors corresponding to the eigenvalue 5 by solving (A — 51) x = 0 . 
We have 


A — 51 — 


'-4 

2 ' 

R 2 ~\~R i 

'i 

-1/2' 

. 4 

-2. 

? 

then A-R i 

.0 

0 


It follows that the eigenvectors corresponding to 5 are 


for any nonzero scalar a. 

(c) Diagonalize the matrix^ . 

From part (a) and part (b), we have seen thatv4 has eigenvalues —1 and 5 with corresponding 
eigenvectors 



l ' 


r 

U == 

-l 

and v = 

2 


(Here we chose the scalars a to be 1 but you could use any nonzero values for the scalars a.) 























Let 


S = [u v] 


1 1 

-1 2 


Then the general procedure of the diagonalization yields that the matrix S is invertible and 


S~ l AS = D , 


where D is the diagonal matrix given by 


0 
5 

(d) Diagonalize the matrix^ 3 — 5A 2 + 3A + / . 

In part (c), we obtained 



S ~ l AS = D, 


where 



' 1 1 

and D — 

-i 

o' 

s = 




-1 2 


. 0 

5 


Note that we have A — SDS 1 and 

A 2 =AA = SDS _1 • SDS _1 = SD 2 S _1 
A 3 =A 2 A = SD 2 S~ 1 SDS- 1 — SD 3 S~ 1 . 


These relations gives 


A 3 - 5A 2 + 3A + I = SD 3 S ~ l - 5 SD 2 S _1 + 3 SDS ~ l + / 
= S(D 3 - 5D 2 + SD + I)S-\ 


S-\A 3 -5A 2 + 3A+I)S 
= D 3 - hD 2 + 3D +1 


'(-l) 3 O' 
. 0 5 3 . 

- 5 

'(-i) 2 o' 

. 0 5 2 . 

+ 3 

'-1 O' 

. 0 5. 

+ 

'1 O' 
.0 1. 


-8 0 

0 16 


Hence we obtain 




















The Cayley-Hamilton Theorem 


Summary 


(The Cayley-Hamilton Theorem) Wp(t) is the characteristic polynomial for an n X n matrix A, 
then the matrix p(A) is the n X n zero matrix. 


Example 
Let A 


1 1 
1 3 


. The characteristic polynomial p(t ) of A is 


p(t ) = det(^4 — tl) = 


= t - At + 2. 


1 - t 
1 


1 

3 - f 


Then the Cayley-Hamilton theorem says that the matrix p(A) = A 2 — 4A + 21 is the 2x2 zero matrix. In 
fact, we can directly check this: 


p(A) = A 2 - 4A + 21 = 


'i r 

'i r 

- 4 

'i r 

+ 2 

'1 O' 

.1 3. 

.1 3. 


.1 3. 

.0 1. 


2 4 

4 10 


+ 


-4 -4 

-4 -12 


+ 


2 0 
0 2 


0 0 
0 0 


Problems 

o 


1 0 2 

Let T — 0 1 1 

.0 0 2 

identity matrix. 


. Calculate and simplify the expression — T 3 + 4 T 2 + 5 T — 21, where / is the 3 X 3 


Hint. 

Use the Cayley-Hamilton theorem. 

If p{t) is the characteristic polynomial for an n X n matrix A , then the matrix p(A ) is the n X n 
zero matrix. 


Solution. 

We use the Cayley-Hamilton theorem. 

































The Cayley-Hamilton Theorem. 

If p(t) is the characteristic polynomial for an n X n matrix A , then the matrix p(A ) is the n X n 
zero matrix. 


To obtain the characteristic polynomial for T , we note that the matrix T is upper triangular. Thus T — tl 
is also upper triangular and recall that the determinant of an upper triangular matrix is the product of the 
diagonal entries. Thus the characteristic polynomial p j {t) for T is 

p T (t) = det (T - tl) = (1 - f)(l - t){2 -t) = -1 3 + 4 1 2 -5t + 2. 


By the Cayley-Hamilton theorem, we have 

p T (T) = -T 3 + 4 T 2 - ST + 21 = 0. 

(Don’t forget /.) 

Here O is the 3x3 zero matrix. 


Now we compute 

-T 3 + 4 T 2 + 5T -21 = ( -T 3 + 4 T 2 -5 T + 21) + (10r - 4 1) 

= pt{t) + ior - 4/ = lor - 4/ 


10 

0 

20' 


'4 

0 

o' 

0 

10 

10 

- 

0 

4 

0 

0 

0 

20. 


.0 

0 

4. 


6 0 20 
0 6 10 
0 0 16 


Therefore the answer is 


-T 3 +47 2 + 57 -21 = 


6 0 
0 6 
0 0 


20 

10 

16 


Comment. 

This problem is actually one of the final exam problems in the linear algebra class I taught. 

Of course, you may calculate them directly but most students who computed directly made calculation 
mistakes (unfortunately). 

The second common mistake is that students jumped to conclusion that the expression 
— r 3 +4 T 2 +5 T — 21 is the zero matrix after finding the characteristic polynomial. 

Those students knew that they could use the Cayley-Hamilton theorem but were a bit careless. The given 
expression and the characteristic polynomial are slightly different. 










Anyway, I didn’t subtract many points for the second mistake. 

(And you get little points if you made a mistake for a direct calculation.) 


O 

Find the inverse matrix of the matrix A = 


1 

9 

5 


1 

2 

0 


2 

0 

3 


using the Cayley-Hamilton theorem. 


Solution. 

To use the Cayley-Hamilton theorem, we first compute the characteristic polynomial p(t) of the matrix A . 
We have 


p(t) = det (A — tl) 

1 -1 1 2 

9 2-t 0 

5 0 3 -1 


= (—1) 3+1 5 


2 

0 


+ (- 1) 3+2 • 0 


1 - t 
9 


2 

0 


+ (—1) 3+3 (3 — t) 


1 

2 - t 


(by the 3rd row cofactor expansion) 

= 5(2* - 4) + 0 + (3 - t) ((1 - i){2 -*)-9) 
= - 1 3 + 6 1 2 + 8t- 41. 


Then the Cayley-Hamilton theorem yields that p(A) — O , the zero matrix. That is, we have 

O - p{A) = -A 3 + 6A 2 + 8A - 417. 


Thus, we have 


417 = -A 3 + 6A 2 + 8A = A(-A 2 + 6A + 87), 


or equivalently 


I = A[ 41 (-d 2 +6H + 87) ). 


It follows that the inverse matrix is given by 


A- 1 = A(_h 2 + 6H + 87). 


By a direct computation, we have 


a 2 = 


20 3 8 

27 13 18 
20 5 19 


and 















■A 2 + 6A + 81 = - 

'20 3 8 ' 

27 13 18 

+ 6 

'1 1 2' 

9 2 0 

+ 8 

'1 0 

0 1 

o' 

0 


20 5 19 


5 0 3 


0 0 

1 


-6 3 4 

27 7 -18 

10 -5 7 


Therefore the inverse matrix is 




1 

41 


-6 3 4 

27 7 -18 

10 -5 7 


More Exercise 


Test whether you understand how to find the inverse matrix using the Cayley-Hamilton theorem by the 
next problem. 

Problem. Find the inverse matrix of the 3 X 3 matrix 


A = 


7 2 -2 

-6 -1 2 
6 2-1 


using the Cayley-Hamilton theorem. 


Find the inverse matrix of the 3x3 matrix A = 


Solution. 


7 2 -2 

-6 -1 2 
6 2-1 


using the Cayley-Hamilton theorem. 


To apply the Cayley-Hamilton theorem, we first determine the characteristic polynomial p(t) of the 
matrix A. 

Let / be the 3 x 3 identity matrix. 

We have 


pit) = 

det (A 

-tl) 




1-t 

2 -2 



= 

-6 

-1-t 2 




6 

2 -1-t 





-1-t 2 


-6 2 


-6 -1-t 

= (7 -t) 

2 -1-t 

-2 

6 -1-t 

+ ( —2) 

6 2 


(by the first row cofactor expansion) 


= -t 3 + 5t 2 ~lt+3. 


(You may also use the rule of Sarrus to compute the 3 x 3 determinant.) 
























Thus, we have obtained the characteristic polynomial 

pit) = ~t 3 + 5t 2 ~lt + 3 

of the matrix A. 

The Cayley-Hamilton theorem yields that 

O = p(A) = - A 3 + 5A 2 - 7A + 31, 

where O is the 3x3 zero matrix. 

(Here, don’t forget to put the identity matrix I.) 

Rearranging terms, we have 

A 3 - 5A 2 + 1A = 3I 
A(A 2 -5A + 1I) = 3I 

o a(^(A 2 - 5A + 7 /) J = I. 


Similarly, we have 


~(A Z -5A + 7I)\A=L 


It follows from these two equalities that the matrix 

1 „ 


is the inverse matrix of A. 
Therefore, we have 


A ~ 1 = ~(A Z - 5A + 1I) 


1 


~(A Z -5A + 71) 


/ 

■25 8 - 8 1 


'7 2 - 2 ' 


-24 -7 8 

-5 

-6 -1 2 

1 

24 8 -7 


6 2-1 


+ 7 


r -3 -2 2 

6 5-2 

-6 -2 5 


In summary, the inverse matrix of A is 


A~ l = - 


-3 -2 
6 5 

-6 -2 


2 

-2 

5 













Let 


A = 


1 

2 



Find the eigenvalues and the eigenvectors of the matrix 

B = A 4 - 3A 3 + 3 A 2 -2A-h 8E. 


(Nagoya University) 



Apply the Cayley-Hamilton theorem. 

That is if p A (t) is the characteristic polynomial of the matrix A, then the matrix p A (A) is the zero 
matrix. 


Solution. 

Let us first find the characteristic polynomial p A (t) of the matrix H. 
We have 


p A (t)= det (A-tI) = 


1 -t -1 
2 3 -t 


= (1 - 0(3 - 0 - ( - 1)(2) = t 2 ~ 4t + 5. 


Solving t 2 - At + 5 = 0, we see that the matrix A has the eigenvalues 2 ± i but it is not a good idea to 
use this directly to find the eigenvalues of the matrix B. 

Instead, note that by the Cayley-Hamilton theorem, we know that 

p t (A) =A 2 -4A + 5I= O, 


where / is the 2x2 identity matrix and O is the 2x2 zero matrix. 


Since we have 

B = A 4 - 3A 3 + 3 A 2 -2A + 8E= (A 2 - 4A + 5I)(A 2 +A+2I)+A~2I, 

we have 


B= A-21 = 




Since the eigenvalues of A is 2 ± z, the eigenvalues of B = A - 21 are 

(2 ± 0 - 2 = ± z. 


Next, we find eigenvectors. 

Let us first find eigenvectors corresponding to the eigenvalue z. 
We have 








Thus we have 


r 2 r x 

"2 1 -/■ 

2 R 1 

■1 (1-0/2- 


0 0 


0 0 


1 -i 

x i = ~~r 

and the eigenvectors associated with the eigenvalue i are 


x = 


1 — i 

i 


where x 2 is any nonzero complex number. 

Or equivalently, scaling the vector by -1 + /, the eigenvectors corresponding to the eigenvalue i are 

r i l 


where a is any nonzero complex number. 


Since B is a real matrix and the eigenvalues / and -i are complex conjugate to each other, the 
eigenvectors of- / are just the conjugates of eigenvectors of /. Thus the eigenvectors corresponding to 
the eigenvalue - / are 


b 


1 

—1 + / 


where b is any nonzero complex number. 


Let A, B be complex 2x2 matrices satisfying the relation A — AB — BA. Prove that A 2 = O, where O is 
the 2x2 zero matrix. 



Q Find the trace of A . 


Use the Cayley-Hamilton theorem 


Proof. 



















We first calculate the trace of the matrix A as follows. We have 


tr (A) = tr (AB - BA) 

= tr (AB) — tr (BA) 

= tr (AB) - tr (AB) = 0. 

Thus tr(X) = 0 and it follows from the Cayley-Hamilton theorem (see below) for the 2x2 matrix A 
that 


O = A 2 — tr (A)A + det(A)I 
= A 2 + det (A)I, 


where / is the 2 X 2 identity matrix. 

Thus, we obtain 

A 2 = -det(A)I. (*) 


Next, we compute A 2 in two ways. 
We have 


A 2 = A(AB -BA)=A 2 B- ABA 


and 


A 2 = (AB- BA) A = ABA -BA 2 . 


Adding these two, we have 


2A 2 = A 2 B -BA 2 

= (- det (A)I)B - B(- det(yt)/) 
= — det (A)B + det (A)B = O. 

As a result, we obtain A 2 — O . This completes the proof. 


The Cayley-Hamilton theorem for a 2 X 2 matrix 


Let us add the proof of the fact we used in the proof about the Cayley-Hamilton theorem. 

a b 


Let A = 


c d 


be a 2 X 2 matrix. 


Then its characteristic polynomial is 

p(x) — det(X — xl) 
a — x b 
c d — x 

— (a — x)(d — x) — be 

— x 2 — (a + d)x + ad — be 

— x 2 — tr (A)x + det (A), 






since tr (A) — a + d and det (A) — ad — be. 

The Cayley-Hamilton theorem says that the matrix A satisfies its characteristic equation p{x ) = 0 . 
Namely we have 

A 2 — tr (A)A + det (A)I = O. 


This is the equality we used in the proof. 

Variation 

As a variation of this problem, consider the following problem. 

Let A , B be 2 X 2 matrices satisfying A — AB — BA . 
Then prove that det (A) — 0 . 


Suppose that A is 2 X 2 matrix that has eigenvalues —1 and 3. Then for each positive integer n find a n and b n 
such that A n+1 = a n A + b n I, where / is the 2 X 2 identity matrix. 

Solution. 

Since -1, 3 are eigenvalues of the matrix A, the characteristic polynomial of A is 

0+ l)(t- 3) = t 2 - 2/- 3. 

By Cayley-Hamilton theorem, we have 

A 2 -2A-3I = O, 


where O is the 2 x 2 zero matrix. 
Thus we have 


A 2 — 2A + 37, 


and hence a x = 2, Z? 1 = 3. 
Multiplying (*) by A, we have 


A 3 = A A 2 = 2 A 2 + 3 A 
= 2(2 A + 31) + 3A 
= 7A + 61. 


Thus, a 2 = 7,b 2 = 6. 
In general, we have 


A n+2 =AA n+1 

= A(a„A + b n I) 

=v 2+ v 

= a n (2A + 37) + b n A 
= (2a n + b n )A + (3a n )L 


Thus we obtain 

a n+l =2a n + b n 
b n+l =3a ,r 

This yields the linear recurrence relation 

a n+2 = 2a n + \ + 3 a n 

with initial values a x = 2, a 2 = 7. 

The general term a n of the sequence (a n )f =n is obtained in the previous post Solve a linear recurrence 
relation using vector space technique: 

(You may alternatively find this using an elementary technique.) 

Then we also have 

** = 3^-! - |(c-i r~'+r). 

(This formula is true for n = 1 as well.) 

In conclusion, we have obtained 

A n = ^(( - !)” + 3 ra+1 )^f + |(( - i) w_1 + 3")7 


for any positive integer n. 


| Let A be an n X n complex matrix. Let p(x) = det {xl — A) be the characteristic polynomial of A and write 
it as 

p(x) = x n + a n -ix n ~ l H-h aix + a 0 , 

where are real numbers. Let C be the companion matrix of the polynomial p(x) given by 

0 0 ... 0 — a 0 

1 0 ... 0 — a\ 

c= 0 1 • 0 - ° 2 = [e 2 ,e 3 ,...,e n ,-a], 

_0 0 ... 1 -a n - 1 

where is the unit vector in C n whose i -th entry is 1 and zero elsewhere, and the vector a is defined by 
a 0 


a — 


ai 


. Then prove that the following two statements are equivalent. 


_ ^n —1 . 

(1) There exists a vector v E C n such that 







v, Av, A 2 v, ..., A n 1 v 


form a basis of C n . 

(2) There exists an invertible matrix S such that S' -1 AS = C. 

(Namely, A is similar to the companion matrix of its characteristic polynomial.) 

Proof. 

Before proving the equivalence of the statements, let us first note a simple but useful fact. 

Let B be n x n matrix and let e z - be the unit vector of C n as defined in the problem. Then the matrix 
product Be f is the i-th column vector of the matrix B. 


(1 =* 2 ) 

Suppose that the statement 1 is true. That is, there exists a vector v E C n such that 

v, Av, ...,A n ~ l \ 

form a basis of C n . In particular, these vectors are linearly independent. 

Form a matrix 


S := [v, Ax ,..., A n ~ ! v], 

then the matrix S is invertible since the columns vectors are linearly independent. 

We prove that AS = SC. 

We have 

AS = A[v,A\, ...,A n ~ l \], 

= [Av,A 2 \, ...,A n v], 

On the other hand, we have 

SC = S[e 2 , e 3 ,..., e n , - a] 

= [Se 2 ,Se 3 ,...,Se n , -5a]. 

As noted at the beginning, the vector Se 2 is the second column vector of S, which is Av. Similarly, Se t 
is the i-th column vector of S , and thus we have 

5e,. - 'v. 

Hence we have 

SC = [Av,A 2 v, ...,A n ~ l y, -5a], 

Thus, it remains to prove that -5a = A n \. 

Write 


n— 1 


= a 0 e l +a 1 e 2 + -+a n _ i e n . 




Then we have 

Sa = S(a^e^ + a^ 2 + *“ + a n-\^r) 

= a 0 5e 1 +a 1 5e 2 + - + a„_ 1 Se n 
( * ) 

= a^y + ciyAy +—v a n _ x A n ~ x y 
|= («(/+ a X A + *•• + a n _ l A n *)v 
= -A n y, 


where the last step follows from the Cayley-Hamilton theorem 

O =p(A) = A n + a n _ l A n ~ l + ---a^A + a^I. 

This proves that ~Sa = A n y and hence AS = SC. 

Since S is invertible, we obtain 

S~ 1 AS=C. 


(2 => 1 ) 

Next suppose that the statement 2 is true. That is, there exists an invertible matrix S such that 
S~ l AS = C. Then we have AS = CS. 

Define v := Se t to be the first column vector of S. 

We prove that with this choice of v, the vectors 

v,Av,A 2 v, ...,A n ~\ 

form a basis of C n . 

We have 


AS A [Se i, Se 2 , • • •, Se J 
= [ASe^,ASe 2 , . ..,v4SeJ 


and 


SC *^[ e 2’ e 3’ ''•’ a ] 

= [Se 2 , Se 2 ,..Se n , — Sa]. 


Since AS = SC, comparing column vectors, we obtain 

^4*Se 1 = Se 2 
ASe 2 Se^ 

ASe n -, =Se n 
ASe n = - 5a. 


It follows that we have inductively 


Ax = Se 2 

A 2 x = A(Ax) = ASe 2 = Se 3 
A n ~ l \ =A(A n ~ 2 v) =ASe n _ 1 = Se >r 


Namely we obtain 

A'v = Se i+l 


for i = 0 , 1 , — 1 . 

Now, suppose that we have the linear combinaiton 

CjV + c 2 A\ + ...c n A n ~ 1 v = 0, 
for some coefficient scalars c 1? c n 6 C. 

Then we have using the above relations 

+ c 2 Se 2 + ••• + c n Se n = 0. 

Since S is invertible, the column vectors Se t are linearly independent. 
Thus, we must have c 1 = c 2 = ••• = c n = 0. 


Therefore the vectors 


v,Av,A 2 v, ...,A n ~ 1 v 

are linearly independent, and they form a basis of C n since we have n linearly independent vectors in 
the ^-dimensional vector space C n . 


Let n > 1 be a positive integer. Let V = M nxn (C) be the vector space over the complex numbers C consisting 
of all complex n x n matrices. The dimension of V is n 2 . Let A £ V and consider the set 

S A = {I = A°,A,A 2 ,...,A n2 - 1 } 

of n 2 elements. Prove that the set S a cannot be a basis of the vector space V for any A E V . 

Proof. 

We prove that the set Sa is linearly dependent, hence it cannot be a basis of V. 

Since A is an n X n matrix, its characteristic polynomial p(t ) = det (tl — A) is a degree n 
polynomial. 

(Your preferred definition of the characteristic polynomial might be det (A — tl ). It is straight 
forward to modify the following proof with this definition.) 


Let us write it as 


p(t) — t n + dn-it n ’*’ + ••• + CL\X + ao. 


Then the Cayley-Hamilton theorem states that 


p(A) — A n + a n -iA n + • • • + ciiA + a^I — O 


is the zero matrix. 

Since the coefficient of A n is 1, this gives a non-trivial linear combination of /, A ,. . ., A n . 
Therefore the set 


T:={I,A,...,A n } 


is linearly dependent. 

As T is a subset of Sa> the set Sa is also linearly dependent. 
Therefore, Sa is not a basis of V . This completes the proof. 


Let A be a 3 X 3 real orthogonal matrix with det(vl) = 1. 

(a) If Ar?i is one of the eigenvalues of A, then find the all the eigenvalues of A. 

(b) Let A 100 = a A 2 + b A + cl , where / is the 3 X 3 identity matrix. 

Using the Cayley-Hamilton theorem, determine a, 6, c. 

(Kyushu University) 


Solution. 

(a) Find the all the eigenvalues of A. 

Since A is a real matrix and [ s a complex eigenvalue, its conjugate 1 is also an 

eigenvalue of A. 

As A is a 3 X 3 matrix, it has one more eigenvalue A. 


Note that the product of all eigenvalues of A is the determinant of A. 
Thus, we have 


-l + y/3 i 

2 


— 1 — y/3i A 1 / A\ 

---A = det(A) 


1 . 


Solving this, we obtain A = 1. 
Therefore, the eigenvalues of A are 


— 1 + y/3i —1 — \/3 i 

2 5 2 

(a) Using the Cayley-Hamilton theorem, determine a, &, c. 


To use the Cayley-Hamilton theorem, we first need to determine the characteristic polynomial 
p(t ) = det(A — tl ) of A. 

Since we found all the eigenvalues of A in part (a) and the roots of characteristic polynomials are 
eigenvalues, we know that 










— — (i 2 + t + l)(i — 1) 

= -1 3 + 1. 

(Remark that if your definition of the characteristic polynomial is det (tl — A), then the first 
negative sign in (*) should be omitted.) 

Then the Cayley-Hamilton theorem yields that 

P(A) = -A 3 + I = O, 

where O is the 3x3 zero matrix. 

Hence we have A? = I. 

We compute 

A 100 = (A 3 ) 33 A = I 33 A — IA — A. 

Thus, we conclude that a = 0,b=l,c = 0. 

Comment. 

Observe that we did not use the assumption that A is orthogonal. 




Let A and B be 2 X 2 matrices such that ( AB ) 2 = O, where O is the 2 X 2 zero matrix. Determine whether 
(. BA ) 2 must be O as well. If so, prove it. If not, give a counter example. 

Proof. 

It is true that the matrix ( BA ) 2 must be the zero matrix as we shall prove now. 

For notational convenience, let C := AB. 

As C is a 2 X 2 matrix, it satisfies the relation 

C 2 - tr(C)C + det(C) J = O (*) 

by the Cayley-Hamilton theorem. 

Here I is the 2x2 identity matrix. 


We compute the determinant of C as follows. 
We have 


det(C) 2 = det(C 2 ) = det((AB) 2 ) = det(O) = 0. 


Hence det(C') = 0. 




Since C 2 = O and det(C) = 0, the Cayley-Hamilton relation (*) becomes 


tr (C)C = O . 

It follows that we have either C — O or tr(C) = 0. 

In either case, we have tr(C) = 0. 


Note that 

det(5i4) = det(AB) = det (C) = 0 
tr(BA) = tr(AB) — tr(C) = 0. 

Applying the Cayley-Hamilton theorem to the matrix BA, we obtain 

O = (BAf - tr (BA) • BA + det (BA)I = (BA) 2 . 


Thus, we obtain (BA) 2 — O as claimed. 


Dot Products and Length of Vectors 


Definition 


Let A be a square matrix. 

Two vectors Vi and V 2 are orthogonal (perpendicular) if Vi • V 2 = 0. 

Let v be n-dimensional complex vector. Then the length of v is defined to be 

ll v ll = \/v T v. 

The distance between two vectors Vi, V 2 is the length || Vi — V 2 1|. 

A is orthogonal if A T A — I — AA T . 

A is normal if A* A — A — A A* . Here A * = A . 


Summary 


Let A be a square matrix. Let v £ K n , where K = R or C. 

O ll v ll — 0- 

A is orthogonal if and only if its column vectors form an orthonormal set. 


Problems 

Let Ui, U2, U3 are vectors in R n . Suppose that vectors Ui, U2 are orthogonal and the norm of 112 is 4 and 
uj U3 = 7 . Find the value of the real number a in Ui = U2 + cm3. 

(The Ohio State University) 

Hint. 


Recall the following definitions. 

Q The inner product (dot product) of two vectors v 1? v 2 is defined to be 

V 1 ' v 2 :=v ^ v 2* 


Two vectors v 1? v 2 are orthogonal if the inner product 

















Q The norm (length, magnitude) of a vector v is defined to be 

I M | = V v ‘ v 


Solution. 

We first express the given conditions in term of inner products (dot products). 
Since u 1 and u 2 are orthogonal, the inner product 

u 2 • Uj = • u 2 = 0. 

Also, since the norm of u 2 is 4, we obtain 

U 2 • u 2 = I I U 2 1 I 2 = 16. 


The last condition can be written as 


u 2 u 3 = U 2 U 3 = 7. 

Now we compute the inner product u 2 and 

O) 

0 = u 2 • = u 2 • (u 2 + au 3 ) 

= u 2 • u 2 + au 2 • u 3 

= 16 + la. 


Therefore, solving this we obtain 


a 


16 

~ 7 ‘ 


^ Let a and b be vectors in R n such that their length are ||a|| = ||b|| = 1 and the inner product 


a • b = a x b =-. 


Then determine the length 11 a — b} |. 

Solution. 


Recall that the length of a vector x is defined to be 

llxll = ^/x T x, 


where x T is the transpose of x. 

Also, recall that the inner product of two vectors x, y are commutative. 
Namely we have 

x y = x T y = y T x = y x. 




Applying the second fact with given vectors a, b, we obtain 


a T b = b T a = - 


1 

2 ’ 


Now we compute la - bll 2 as follows. 
We have 


la - bll 2 = (a - b) T (a - b) (by definition of the length) 
= (a T - b T )(a - b) 

= a T a - a T b - b T a + b T b 

*= Hall 2 - a T b - b T a + llbll 2 



= 3. 


Since the length is nonnegative, we take the square root of the above equality and obtain 

la - bll = a/3. 


Let A and B be n X n skew-symmetric matrices. Namely A T — —A and B T — —B. 

(a) Prove that A + B is skew-symmetric. 

(b) Prove that cA is skew-symmetric for any scalar c. 

(c) Let P be an m X n matrix. Prove that P T AP is skew-symmetric. 

(d) Suppose that A is real skew-symmetric. Prove that i A is an Hermitian matrix. 

(e) Prove that if AB = —BA, then AB is a skew-symmetric matrix. 

(f) Let v be an n-dimensional column vecotor. Prove that v T dv = 0. 

(g) Suppose that A is a real skew-symmetric matrix and A 2 v = 0 for some vector v E M. n . Then prove that 
Av — 0. 

Proof. 

(a) Prove that .1 + B is skew-symmetric. 


We have 


(A+B) J = A t + B t = + -(. A+B ). 


Hence A+B is skew-symmetric. 


(b) Prove that cA is skew-symmetric for any scalar c. 


We compute 


(cA) t = cA t = c( — A) = - cA. 


Thus, cA is skew-symmetric. 


(c) Let P be an m x n matrix. Prove that P J AP is skew-symmetric. 





Using the properties of transpose, we have 

(P J AP) t = P 1 A 1 (P 1 ) 1 = p t a t p 
= P 1 (-A)P = - (P 1 AP). 

This implies that P 1 AP is skew-symmetric. 

(d) Suppose that A is real skew-symmetric. Prove that iA is an Hermitian matrix. 

Note that since A is real, we have A = A. 

Then we have 

(iA) 1 = (iA) 1 = ( - iA) 1 = ( - i)A T = ( - /)( - A) = iA. 

It follows that iA is Hermitian. 

(e) Prove that if AB = - then is a skew-symmetric matrix. 

We calculate 


(^5) t = B 1 A 1 = (-B)(-A) 
= BA = - AB, 


where the last step follows from the assumption AB = - BA. 

This proves that AB is skew-symmetric. 

(f) Let v be an ^-dimensional column vecotor. Prove that v Ay = 0. 

Observe that v T ^v is a 1 x 1 matrix, or just a number. 

So we have 


v T ^v = (v T v4v) T = v T v4 T (v T ) T 

= v T v4 T v = v T ( -A)y= - (v t Av). 


This yields that 2v T v4v = 0, and hence v t Av = 0. 

(g) Suppose that A is a real skew-symmetric matrix and A 2 v = 0 for some vector v E R /? . Then 
prove that Ay = 0 . 

Let us compute the length of the vector Ay. 

We have 


Uvl = (Ay) t (Ay) = y t A t Ay 
= v T ( - A)Ay = -y t A 2 y 
= vO by assumption 

= 0 . 


Since the length L4vll = 0, we conclude that^4v = 0. 


Let v be a vector in an inner product space V over R. Suppose that {ui , ..., u n } is an orthonormal basis of V. 
Let 6i be the angle between v and for i — 1,..., n. Prove that 

cos 2 9 1 +-b cos 2 9 n — 1. 

Definition (Angle between Vectors). 

Let (a, b) denote the inner product of vectors a and b in V. 


Recall that the angle 6 between a and b is defined as the unique number 6 between 0 and n satisfying 


cos# = 


(a,b) 
Hall libII * 


Proof. 

Express the vector v as a linear combination of the basis vectors as 

\ = a 1 u l + - + a n u n 

for some real numbers a l9 .. a n . 

The length of the vector v is given by 


llvll = 


'\fii + •• 


• + a 


2 

rr 


For each i, we have using the properties of the inner product 


(v, U ( ) = (fljUj + - + a„u„,u ( .> 

= fl 1 <u 1 ,u / ) + -» + a fI <u J| ,u f > 


since (u i9 u z ) = 1 and (u ■, u t ) = 0 if / ^ / as {u 1? u^} is orthonormal. 


By definition of the angle, we have 


(v, u ; > <v, u •> 

C ° Se i = llvll llll ( ll = llvll 


since llu-ll = 1. 


It follows that 






COS 2 ^ 1 H- COS 2 0 n 


(v,u t ) 2 <v,u„> 2 

- + ••• + - 

llvll 2 llvll 2 


1 

llvll 2 


, 2 , 2x 

0 1 + - a n) 


1 



2 


1 . 


by (**) 

by (*) 


Thus we obtain 

cos 2 ^ 1 + ••• + cos 2 0 n = 1 


as required. 


Suppose that A is a real n X n matrix. 

(a) Is it true that A must commute with its transpose? 

(b) Suppose that the columns of A (considered as vectors) form an orthonormal set. Is it true that the rows of A 
must also form an orthonormal set? 

(University of California, Berkeley) 


Solution. 

(a) Is it true that A must commute with its transpose? 


The answer is no. 


We give a counterexample. Let 


Then the transpose of A is 




0 

2 


We compute 



1 

-1 

' 1 O' 


' 2 -2' 

aa t = 



= 



.0 

2 . 

.-1 2. 


.-2 4 . 


and 



O' 

'1 

1 

I— 1 
_1 


' 1 

fa' 

2. 

o 

2 . 


-1 

5 . 


Therefore, we see that 


aa t a a t a, 


that is, A does not commute with its transpose A 


T 


(b) Is it true that the rows of A must also form an orthonormal set? 




















The answer is yes. 


Note that in general the column vectors of a matrix M form an orthonormal set if and only if 
M ' t M — I, where I is the identity matrix. (Such a matrix is called orthogonal matrix.) 

Thus, by assumption we have A T A — I. Let B — A T . 

Then the column vectors of B is the row vectors of A. Hence it suffices to show that B T B — I. 

Since A T A — /, we know that A is invertible and the inverse A 1 = A T . 

In particular, we have A T A = AA T = I. 

We have 

b t b = ( a t ) t a t = ( aa t ) t = i T = i. 

Thus, we obtain B^B — I and by the general fact stated above, the column vectors of B form an 
orthonormal set. 

Hence the row column vectors of A form an orthonormal set. 


Recall that a complex matrix is called Hermitian if A* — A, where A* = A . Prove that every Hermitian 
matrix A can be written as the sum A = B + iC, where B is a real symmetric matrix and C is a real skew- 
symmetric matrix. 

Proof. 

Since A is Hermitian, we have 


= A. 


Taking the conjugate of this identity, we also have 


A t — A. 


Let 


B=\(A + A) 

and 

C=±( A -A). 

We claim that B is a real symmetric matrix and C is a real skew-symmetric matrix. 
We have 


B =\(A + A) = \(A + A) = \(A + A) = B. 


Thus, the matrix B is real. To prove B is symmetric, we compute 




B T = i(,l + i) T 
= \^ + a t ) 

= ^ (A + A) by (*) and A is Hermitian 

Zi 

= B. 

This proves that B is symmetric. 

The matrix C is real because we have 

c=^a-A ) =^a-A ) = c. 


We also have 


C T = i(A T -i T ) 

= -j,(A-A) = -C. 


by (*) and A is Hermitian 


Hence C is a skew-symmetric matrix. 

Finally, we compute 

B + iC — — (A + A) + i • — (A — A) 

= \(A + A) + \(A-A) 

= A. 


Therefore, we have obtained the sum as described in the problem. 


(a) Prove that each complex n X n matrix A 
matrices. 

A [ i 

(b) Write the complex matrix A = 

2 x 

matrices. 

Definition (Hermitian matrix). 

Recall that a complex matrix M is said to be Hermitian if M * = M. 
Here A* is the conjugate transpose matrix AT* = M *. 

Proof. 

Let 



can be written as A = B + iC, where B and C are Hermitian 
as a sum A — B + iC, where B and C are Hermitian 

1 + i 






B 


A + A* 
2 


and C 


A- A* 
2 i 


We claim that B and C are Hermitian matrices. 

Using the fact that (^4* )* = A, we compute 

_ A* + (A*)* 

2 


It yields that the matrix B is Hermitian. 


We also have 


C* 


A-A* 
2 i 

A* - ( A *) 



-2 i 

A - A* 
2 i 


C. 


Thus, the matrix C is also Hermitian. 


Finally, note that we have 


„ A + A .A-A* 

B + iC = ---+ i ——— 

2 2 i 

_ A±A? A - A* 


= A. 


Therefore, each complex matrix A can be written as A = B + iC, where B and C are Hermitian 
matrices. 


\item By the proof of part (a), it suffices to compute 


B= A±AL smic= A - A ‘ 


2 i 


We have 


A* = 


—i 2 + i 
6 1 — i 


A direct computation yields that 





















1 


and C = 


1 


-1+2 i 



By the result of part (a), these matrices are Hermitian and satisfy A = B + iC, as required. 


Let a, b be vectors in M n . Prove the Cauchy-Schwarz inequality: |a • b| < ||a|| ||b 

Proof 1 

Let a: be a variable and consider the length of the vector a — xh as follows. 
We have 


0 < ||a — a:b|| 2 = (a — xb) • (a — xb) 

= a a — a xb — a;a b + x 2 b • b 

= ||b|| 2 x 2 — 2a • hx + ||a|| 2 . (*) 


Note that the last expression is an equation of a parabola (quadratic equation). 

Since the parabola is always non-negative, its discriminant D must be non-positive. 

Hence we have 


£>= (2a-b) 2 - 4||a|| 2 ||b|| 2 < 0. 

It follows that we have 

(a-b) 2 < ||a|| 2 ||b|| 2 . 

Taking the square root, we obtain the Cauchy-Schwarz inequality 

|a • b| < ||a|| ||b||. 

Proof2 

The second proof starts with the same argument as the first proof. 

As in Proof 1 (*), we obtain 

0 < ||b|| 2 x 2 — 2a • hx + IIall 2 . 


Now we take 


a • b 



Then we have 







0 < ||b|| 2 


a • b 

llbll 2 


2a • b 


a • b 

llbll 2 


(a • b) 2 (a ■ b) 2 

llbll 2 ||b|| 2 


(a • b ) 2 

llbll 2 



2 



It follows that we have 


(a-b) 2 < ||a|| 2 ||b|| 2 . 

The Cauchy-Schwarz inequality is obtained by taking the square root as in Proof 1. 


<& Let A \, A2 ,..., A m be n X n Hermitian matrices. Show that if 

Af + A^ + • • • + — O , 

where O is the n X n zero matrix, then we have A{ — O for each i — 1, 2, ..., m. 

Hint. 

Recall that a complex matrix A is Hermitian if the conjugate transpose of A is A itself. 
Namely, A is Hermitian if 

A T = A. 

We also use the length of a vector in the proof below. 

Let v be ^-dimensional complex vector. Then the length of v is defined to be 

llvll = 

The length of a complex vector v is a non-negative real number. 

The length is also called norm or magnitude. 

Proof. 

Let x be an a- dimensional vector, that is, x 6 R n . 

Then for each /, we have 

x T ^x = x T ^^ z -x = ( A i x) T (A i x ) = u.xl 2 > 0. 


Here, the first equality follows from the definition of a Hermitian matrix. 


Now we compute 


0 = x T e>x = \ t {a\+A 1 1 + ••• +A 1 m )\ 
= x T 4x + x T ^ 2 x + -+x T 4x 
= U,xl 2 + L4,xl 2 + - + L4 xll 2 . 

1 z m 







Since each length \\A -xll is a non-negative real number, this implies that we have A-x = 0 for all x 6 R n 
. Hence we must have A - = O for each i = 1,2,.. m. 


0 Find the inverse matrix of the matrix 

~ 2 1 i " 

7 7 7 

^ _ 6 . 1 _ 3 

7 7 7 

1 6 . _1 

_ 7 7 7 _ 

Hint. 

You may use the augmented matrix method to find the inverse matrix. 

Here we give an alternative way to find the inverse matrix by noting that A is an orthogonal matrix. 
Recall that a matrix B is orthogonal if B B = B B = I. 

Thus, once we know B is an orthogonal matrix, then the inverse matrix B ~ 1 is just the transpose 
matrix i? . 

Also, recall that a matrix B is orthogonal if and only if the column vectors of B form an orthonormal 
set. 

Solution. 

We first show that A is an orthogonal matrix. 

To do this, it suffices to that the column vectors form an orthonormal set. 

Let 



be the column vectors of A. 

Then the length of the vector Vj is 

ijvjl I = A /(2/7) 2 + (6/7) 2 + (- 3/7) 2 = 1. 

Similarly, we have | | v 2 | | = | | v 3 | | — 1. 

Thus, column vectors are unit vectors. 

The dot (inner) product of the vectors v 1 and v 2 is 











Similarly, we have 


2 3 6 2 / 3 \ 

7 ‘ 7 + 7 ' 7 + ("7 j 



Vj • v 3 M| 0, v 2 • v 3 = 0. 


Therefore, the column vectors are orthogonal. 

Hence the column vectors of A are orthonormal, and this implies that A is an orthogonal matrix. 
Namely, ,4 T =A~\ 

Thus the inverse matrix of A is 


2 

7 


6 

7 


3 

7 


A~ l = 


3 2 
7 7 



6 

7 


2 

7 


0 A complex square (n X n) matrix A is called normal if 

A* A = AA*, 

-T 

where A * denotes the conjugate transpose of A, that is A* = A . 

A matrix A is said to be nilpotent if there exists a positive integer k such that A k is the zero matrix. 

(a) Prove that if A is both normal and nilpotent, then A is the zero matrix. You may use the fact that every normal 
matrix is diagonalizable. 

(b) Give a proof of (a) without referring to eigenvalues and diagonalization. 

(c) Let A,BbQn X n complex matrices. Prove that if A is normal and B is nilpotent such that A + B — /, 
then A — /, where / is the n X n identity matrix. 

Proof. 

(a) If A is normal and nilpotent, then A = O 

Since A is normal, it is diagonalizable. Thus there exists an invertible matrix P such that P~ l AP = D , 
where D is a diagonal matrix whose diagonal entries are eigenvalues ofv4. 

Since A is nilpotent, all the eigenvalues of A are 0. (See the post “Nilpotent matrix and eigenvalues of 
the matrix” for the proof.) 

Hence the diagonal entries of D are zero, and we have D = O, the zero matrix. 

It follows that we have 


A =PDP~ l =POP~ l = O. 


Therefore, every normal nilpotent matrix must be a zero matrix. 


(b) Give a proof of (a) without referring to eigenvalues and diagonalization. 





Since A is nilpotent, there exists a positive integer k such that A k = O. 
We prove by induction on k that A = O. 

The base case k = 1 is trivial. 


Suppose k > 1 and the case k— 1 holds. Let B = A k ~ l . Note that since A is normal, the matrix B is 
also normal. 

For any vector x E C n , we compute the length of the vector B *Bx as follows. 

II B * Bx II = (B* Bx) * (B * i?x) by definition of the length 

= x*B*(BB*)Bx 

= x*B*(B* B)Bx since B is normal 

= x*(B*) 2 B 2 x, 

and the last expression is O since B 2 = A 2k ~ 2 = O as k > 2 implies 2k - 2 > k. 

Hence we have B*Bx = 0 for every x E C n . 


This yields that 


WBxW = (Bx)*(Bx) 

= x*B*Bx = 0, 


for every x E C n , and hence B = O. 

By the induction hypothesis, A k ~ 1 = O implies A = O, and the induction is completed. 

So the matrix A must be the zero matrix. 

(c) If A is normal and B is nilpotent such that A + B = /, then A = I 

We claim that the matrix B is normal as well. If this claim is proved, then part (a) yields that B = O 
since B is a nilpotent normal matrix, which implies the result A = I. 

To prove the claim, we compute 

B*B = (I-A)*(I-A) 

= (!-A*)(I-A) 

= I-A-A* +A*A, 


BB* = (I-A)(I-A)* 

= (I-A)(I-A*) 

= I — A* -A+AA* 

= / - A* ~A+A*A since A is normal. 

It follows that we have B*B = BB*, and thus B is normal. 


Hence, the claim is proved. 









Eigenvalues and Eigenvectors of Linear Transformations 


Definition 


Let T : V —> V be a linear transformation from a vector space V to itself. 

We say that A is an eigenvalue of T if there exists a nonzero vector v £ V such that T(v) = Av. 

Q For each eigenvalue A of T, nonzero vectors v satisfying T(v) = Av is called eigenvectors 
corresponding to A. 


Problems 


^ Let T be the linear transformation from the vector space R 2 to R 2 itself given by 

X\ \ _ 3xi + X2 

_x 2 \J lxi+3x 2 _ 


(a) Verify that the vectors 


Vi = 


1 

-1 


and v 2 — 


are eigenvectors of the linear transformation T, and conclude that B — {vi, v 2 } is a basis of R 2 consisting of 
eigenvectors. 

(b) Find the matrix of T with respect to the basis B — {vi, v 2 }. 


Solution. 


(a) B = {Vi, V 2 } is a basis of M 2 consisting of eigenvectors 


We compute that 


T(vi) =T 


1 

-1 


2 

-2 


= 2 


1 

-1 


= 2vi 


and 


T(v 2 )=T 




= 4v 2 . 


Thus, is an eigenvector corresponding to the eigenvalue 2 and v 2 is an eigenvector corresponding 
to the eigenvalue 4. 

Since , v 2 are eigenvectors corresponding to distinct eigenvalues, they are linearly independent, 
and thus B = { Vi , v 2 } is a basis of R 2 . 


(b) Find the matrix of T with respect to the basis B 


From the computation in part (a), we have 































T(v 1 ) = 2vi + 0 v 2 
T(v 2 ) = Ovi + 4v 2 . 

Hence the coordinate vectors of T( Vi), T (V 2 ) with respect to the basis B — {Vi, v 2 } is a basis 
ofM 2 are 

EHvOIj, = 

Thus the matrix A of the linear transformation T with respect to the basis B is 

J 4=[[T(v,)] i ,,[T(v 2 )lB]= \l ° . 


,[T(v 2 )] b = 


Let Pi be the vector space of all real polynomials of degree 1 or less. Consider the linear transformation 
T : Pi —> Pi defined by T(ax + b) = (3 a + b)x + a + 3 for any ax + b £ Pi. 

(a) With respect to the basis B — {1, cc}, find the matrix of the linear transformation T. 

(b) Find a basis B' of the vector space Pi such that the matrix of T with respect to B' is a diagonal matrix. 

(c) Express f(x) =5#-f3asa linear combination of basis vectors of B f . 


(a) Find the matrix of the linear transformation T with respect to the basis B = (i,4 • 
The matrix A for T with respect to the basis B — { 1 , x} is given by 

^ = [[r(i)] s ,[r(x)] s ]. 

Since we have 


7(1) = x + 3 and T (x) = 3x + 1, 
the coordinate vectors of these vector with respect to B are 


[T(1)]b = 


and [r(x)] g = 



Therefore, the matrix A for T with respect to B is 

^=[ 3 

_1 

(b) Find a basis B 1 of the vector space P\ such that the matrix of T with respect to B 1 is a diagonal 
matrix. 

We first diagonalize the matrix A obtained in (a). 

Solving the characteristic polynomial 

3 -t 1 
1 3 -t _ 

= t 2 -&t + % = {t- 2 )(t - 4), 


Pa (t) = det(4 — tl) 


1 

3 


















the eigenvalues are 2 and 4. Since A has two distinct eigenvalues, it is diagonalizable. 


We find the eigenvectors corresponding to the eigenvalue 2. 
The eigenvectors are nonzero solutions of (A — 2/)x = 0 . 

1 


From this, we find that 
Similarly, we find that 

Thus, if we set 


-1 

1 

1 


is an eigenvector, 
is an eigenvector corresponding to the eigenvalue 4. 


then we have the diagonalization of A : 


P = 


2 0 
0 4 


1 1 

-1 1 


= p~ l ap. 


It follows that 


2 0 
0 4 


is the matrix representation for T with respect to the basis 

B' = {vi,v 2 }, 


where 


vi g = 


and [v 2 ]b = 


and P is the transition matrix (change of basis) from B 1 to B . 

(Note that 

[vi]g =P[vi\ B ' =P 

Similarly for [v 2 ]b •) 

Hence, v i = —x + 1 and y 2 = x + 1 form the basis B f = {v i, v 2 } with respect to which the 

2 O' 


1' 


' 1 ' 

.0. 


-1 

T—1 

1 


matrix for T is the diagonal matrix 


0 4 


(c) Express f \x) = 5x + 3 as a linear combination of basis vectors of B 1 . 

Note that since P is the transition matrix from B f to B , we have the following relation between coordinate 
vectors. For any v E Pi , we have 


P[v]g, = [v]g. 


Using this, we have 


w)\b' =p- L m\ 


i -i 

i i 




























Hence we obtain the linear combination of f{x) with v i, V 2 : 

f(x) = (-l)vi +4v 2 

(You may confirm that 5x + 3 = — (— x + 1) + 4(x + 1) .) 

Let V be a real vector space of all real sequences = (a\ , a 2 ,...). Let U be a subspace of V defined by 

u = {(a*)“i G V I a n+ 2 = 2a n+ i + 3a n for n = 1,2,...}. 

Let T be the linear transformation from U to U defined by 

T((a 1 ,a 2 ,...)) = (a 2 , a 3 ,...). 

(a) Find the eigenvalues and eigenvectors of the linear transformation T. 

(b) Use the result of (a), find a sequence satisfying a\ —2^2 = 7. 


(a) Find the eigenvalues and eigenvectors of the linear transformation T. 

Note that each sequence (a z - )?? 1 in U is determined by the first two numbers ai, <22 since the rests can be 
obtained from the linear recurrence relation a n +2 = + 3 a n . 

Thus, for example, we take (a 1 , CI2) — (1, 0), (0,1) and obtain a basis B — {ui , U 2 } of U, where 

m = (1,0,3,6,21,..-) 
u 2 = (0,1,2,7,20,...). 

We find the matrix A of T with respect to the basis B . 

We have 

7(ui) = (0,3,6,21,* • •) = 0ui + 3 u 2 
T( u 2 ) = (1,2,7,20,...) = U!+2 u 2 . 

Thus, the matrix A for T is 



Then the eigenvalues of T are eigenvalues of A . 

The characteristic polynomial for A is 

Pa (0 = det(^4 — tl) — t 2 — 2t — 3 = (t + 1)(^ — 3). 

Thus the eigenvalues are t — —1,3. 

Let us find eigenvectors corresponding to t = — 1 . 

We solve 

T((ai,a 2 ,...)) = -(ai,a 2 ,...), 

or equivalently, 

(a 2 ,a 3 ,...) = —(a\,a 2 ,...). 





This yields the relation 


^n+l — 1 


fox n — 1,2,.... 

Hence the sequence ((2 Z - )^ 1 is a geometric progression with initial value a\ and ratio —1. 

Therefore, the sequence (a* )^ 1 such that 

at = (—iy _1 ai 

are eigenvectors corresponding to the eigenvalue — 1. 

Similarly, eigenvectors corresponding to the eigenvalue 3 are geometric progression with initial value a\ 
and ratio 3, thus 

cii — 3 /-1 ai. 


(b) Find a sequence satisfying a\ =2,(22 = 7. 

From part (a), we see that sequences 

Vi = ((-ly- 1 )^ and v 2 = 

form a basis of U consisting of eigenvectors of T. (We chose a\ — 1 .) 

Hence any sequence (a* )?^ 1 can be written as a linear combination of these two vectors: 

(«/)“ 1 = CiVi + C 2 V 2 

= ci((-iy- 1 )~ 1 +c 2 (3 / - 1 )~ 1 


for some Ci, C 2 . 

If the sequence satisfies the initial condition a\ — 2, a 2 = 7, then we have 

2 = a\ = ci + c 2 

7 — ci2 — —ci + 3 c 2 . 

Solving this, we obtain Ci = —1/4 and C 2 = 9/4. 

Hence we have 


In summary, the sequence (a z )^ 1 satisfying the linear recurrence relation a n+ 2 = 2a w+ i + 3(2^ with 
initial conditions a 1 = 2,(22 — 7 is 


= i((-l)'+3'+>)“. 

Check out the following problems about how to solve a linear recurrence relation using the vector space 
techniques, like the current problem. 

Let T : R 2 —> R 2 be a linear transformation and let A be the matrix representation of T with respect to the 
standard basis of R 2 . Prove that the following two statements are equivalent. 

(a) There are exactly two distinct lines L \, L 2 in R 2 passing through the origin that are mapped onto themselves: 

T{L 1 ) = L 1 and T(L 2 ) = L 2 . 

(b) The matrix A has two distinct nonzero real eigenvalues. 


Proof. 

(a) =*► (b). 

Suppose that the statement (a) holds. 

That is, there are lines L\,L 2 in R 2 passing through the origin that are mapped onto themselves, and no 
other lines passing through the origin are mapped onto themselves. 

A line passing through the origin in R 2 is a one-dimensional subspace in R 2 . Thus, it is spanned by a 
single nonzero vector. 

We have 


L\ — Span(vi) and A 2 = Span(v 2 ) 
for some nonzero vectors v 1 , v 2 • 


Since T (L 1 ) — L 1 , we have Ay \ £ L 1 = Span(v 1 ) . 
Hence there exists X \ E R such that 


Ay 1 = X\Y\. 

Since v 1 is a nonzero vector, this means that X\ is an eigenvalue of A . 


We claim that X 1 7 ^ 0. Otherwise, we have Ay 1 — 0 . 

Any vector in A 1 is of the form v = cly\ for some a £ R. So we have 

Ay — A(avi) = aAv 1 = 0, 

and this yields that T(L 1 ) = {0},a contradiction. 

Hence X \ is a nonzero real eigenvalue of A . 

By the same argument, there is a nonzero real eigenvalue X 2 such thatv 4 v 2 — X 2 y 2 . 


It remains to show that X \ 7 ^ X 2 . 

Assume on the contrary that X X \ — X 2 . 



2 

Since v 1 , V 2 are a basis of two distinct lines, they form a basis of R . 
Hence any vector v G R 2 can be written as a linear combination 


v = 


av i + bv 2 


for some a,b G R. 


Then we have 


Ay — A(av i + &V 2 ) 
= av4vi + bAv 2 
= aAvi + Mv2 
= A(avi + Z>v 2 ) 


= Xy. 


This implies that any line spanned by a nonzero vector v is mapped onto itself by the linear transformation 
T. 

This contradicts our assumption that L i, L 2 are the only such lines. 

Therefore, X \ 7 ^ X 2 , and we have proved that A has two distinct nonzero real eigenvalues. 



(a). 


Now we suppose that the statement (b) holds. 

Namely, we suppose that there are two distinct nonzero real eigenvalues of A . Let us call them X \, X 2 and 
let v 1 , v 2 be eigenvectors corresponding to X \, X 2 , respectively. 

In general eigenvectors corresponding to distinct eigenvalues are linearly independent. 

Thus, v 1 , v 2 are linearly independent. 

Hence the lines L \, L 2 spanned by v 1 , V 2 are distinct. 

Each vector on L 1 is of the form v = a v 1 for some a G R . Hence we have 

Ay = A(av 1 ) = aAv 1 = aX\Vi = Xy. 

It follows that each vector v G L 1 is mapped onto Z 1 . 

(Since X 7 ^ 0, the image under T is one-dimensional, hence T(L 1 ) = L \.) 

Similarly, we have T(L 2 ) — L 2 . 

Finally, we show that no other lines passing through the origin are mapped onto themselves by T . 

Assume that we have a line L — Span(w) such that T(L) = L . 

Then since we have Aw G L = Span(w) , there is /i G R such that 


Aw = //w. 


(*) 


r 'i 2 

Since {v 1 , v 2 } is a basis of R , we have 


w = av\ + bY 2 


for some a, b £ R. 

Then the left hand side of (*) becomes 


Aw = A(av i + Z?v 2 ) 

= <x4vi + bAv 2 
= aAivi + M 2V2. 


Thus the relation (*) yields that 


a/livi + M 2V2 = //avi + //Z?V 2 

tf(^i -ju)vi + Z>(/L 2 -/0 V 2 = 0- 


Since vr , v 2 are linearly independent, we have 

a(A 1 — //) = 0 and 6(A 2 — ju) = 0. 

The equality a (A 1 — //) = 0 implies that either a — 0 or A 1 — fi. 
lfa — 0, then we have w — bv 2 , and hence L — L 2 . 

If// = A 1 , then the second equality 6(A 2 — //) = 0 implies that b — 0 since /l 1 /l 2 • 
Then we have w = av 1 , and hence L — L 1 . 

In either case, Z must be Zi or . 

This completes the proof. 


Let y be a real vector space of all real sequences = (&i, a 2 ,...). 

Let U be the subspace of V consisting of all real sequences that satisfy the linear recurrence relation 
a,k +2 — 5a & + 1 + 3a& — 0 for k — 1,2,.... Let T be the linear transformation from U to U defined by 

T((oi,a 2 ,...)) = (a 2 ,a 3 ,...). 

Let B — {ui, U 2 } be a basis of U, where 

m = (1,0, -3,-15, -66,...) 
u 2 = (0,1,5,22,95,...). 

Let A be the matrix representation of the linear transformation T : U — >• U with respect to the basis B. 

(a) Find the eigenvalues and eigenvectors of T. 

(b) Use the result of (a), find a sequence (a^)^ 1 satisfying the linear recurrence relation 
a k+2 ~ 5a ^ + 1 + 3 aj~ — 0 and the initial condition a\ — 1, a 2 = 1. 

(c) Find the formula for the sequences (a^)^ satisfying the linear recurrence relation 
a k+2 ~ 5a& + i + 3a& — 0 and express it using a\ , a 2 - 


Solution. 


(a) Find the eigenvalues and eigenvectors of the linear transformation T . 

Note that each sequence (a z - )?? 1 in U is determined by the first two numbers a\ , a 2 since the rests can 
obtained from the linear recurrence relation a n +2 — + 3 a n . 

Thus, for example, we take (a 1 , ) = (1, 0), (0,1) and obtain a basis B — {ui , u 2 } of U, where 


Ul = (1,0,3,6,21,..-) 
u 2 = (0,1,2,7,20,...). 

We find the matrix A of T with respect to the basis B . 

We have 

T(ui) = (0,3,6,21,- • •) = Oui + 3 u 2 
r(u 2 ) = (1,2,7,20,...) = U! +2u 2 . 

Thus, the matrix A for T is 

A=\° X l. 

[3 2 J 

Then the eigenvalues of T are eigenvalues of A . 

The characteristic polynomial for A is 

Pa if) — det(v4 — tl) — t 2 — 2t — 3 = (t + l)(t — 3). 


Thus the eigenvalues are t — —1,3. 

Let us find eigenvectors corresponding to t = — 1. 

We solve 

T ((ai,a 2 ,• • •)) = -(ai,a 2 ,. • •), 

or equivalently, 

(« 2 ,«3, • • •) = ~(ai,a 2 ,. ■ •)• 


This yields the relation 


^n+l — &n 


for n — 1, 2 ,... . 

Hence the sequence (a z - )^ 1 is a geometric progression with initial value a\ and ratio —1. 

Therefore, the sequence (ai )^ 1 such that 

a-i = 

are eigenvectors corresponding to the eigenvalue — 1 . 

Similarly, eigenvectors corresponding to the eigenvalue 3 are geometric progression with initial value CL\ 
and ratio 3, thus 

3 i — 1 

a l- 

(b) Find a sequence satisfying a\ =2,^2 =7. 

From part (a), we see that sequences 




vi = ((-l y 1 )Z 1 andv 2 = (3 1 ' 

form a basis of U consisting of eigenvectors of T. (We chose 0\ — 1.) 

Hence any sequence {o\ )^ 1 can be written as a linear combination of these two vectors: 

(«/)“ 1 = CiVi + C 2 V 2 

= ci((-l) i - 1 )” 1 +c 2 (3 i - 1 )”i 

for some c \, C 2 • 

If the sequence satisfies the initial condition 0\ — 2,0,2 — 7, then we have 

2 = a\ — ci + C 2 

7 — 02 — —c\ + 3c‘2 . 

Solving this, we obtain C\ — — 1/4 and C 2 = 9/4. 

Hence we have 

= i((-i>')«+ *(*“>& 

In summary, the sequence (a z -)^ 1 satisfying the linear recurrence relation a n +2 — 2a^ + i + 3 o n with 
initial conditions a 1 = 2, a 2 — 7 is 

(<■<)& = ^((-1)' + 3 !+1 )~,. 


Jordan Canonical Form 


Problems 

Let A be an n X n matrix such that tr(A n ) = 0 for all n E N. Then prove that A is a nilpotent matrix. Namely 
there exist a positive integer m such that A m is the zero matrix. 


Steps. 

Use the Jordan canonical form of the matrix A . 


Q We want to show that all eigenvalues are zero. (Review Nilpotent matrix and eigenvalues of the 
matrix) 


Seeking a contradiction, assume some eigenvalues are not zero. 


Using tr (A n ) = 0, create a system of linear equations. 


Calculate the determinant of the coefficient matrix of the system using the Vandermonde matrix 
formula. 


Find a contradiction. 


Proof. 

We first want to prove that all the eigenvalues of A must be zero. 

Seeking a contradiction, assume that some of the eigenvalues of A are not zero. 

So assume that A / , i — 1, ..., r are distinct nonzero eigenvalues of A and each m / > 1 is a multiplicity 
of A i. 

We use the Jordan canonical form of the matrix A . 

There exists an invertible matrix S such that S -1 AS — T , where T is an upper triangular matrix.The 
diagonal entries are eigenvalues of A . 

Then we have for any positive integer n , 











0 = ti(A n ) = ti((STS ~ l ) n ) = tr (ST n S ~ l ) = tv(T n ) 

— m \A\ TYl 2 A 2 TTl r^r • 

Note that since T is an upper triangular matrix, the nonzero diagonal entries of T n are A appearing m / 
times. 

Changing n from 1 to r, we obtain the system of linear equation. (Think m / as variables.) 


Wl i A J T - TVl 2 A 2 T ' ' 

• • T - YYl r A \ 

m \A\ + m 2 A\ + • ■ 

1 • -)- YYl r A r 

III \A ^ T" YYl 2A 2 T - • 

• • YYl r A r 


Equivalently, we have the matrix equation 


A\ A 2 • •• A} 


mi 


'o' 

A\ A 2 2 ... 


m 2 

= 

0 

2 r ... u 

/l /L 2 Ao y 


_ m r _ 


.0. 


Let B denote the matrix above whose entries are powers of eigenvalues A ;. 
We calculate the determinant of B . 


det(i?) — A\A 2 • • % A y 


1 


1 

A 2 


t r—1 i r— 1 

/I 1 a 2 


2 r— 1 

/l 


= /l l/l 2 • • fj (2.J- — Ai) ytz 0. 

l<i<j<n 


(*) 


(Note that the matrix above is a Vandermonde matrix.) 

Thus the matrix B is invertible. This means that the matrix equation (*) has unique solution 


til 1 


'O' 



0 




_ III Y _ 


.0. 


But this is a contradiction because each multiplicity m / is grater than zero. 

Thus we proved that all eigenvalues ofv4 are zero. 

Recall that a matrix is nilpotent if and only if its eigenvalues are zero. 

See the post i 

Nilpotent Matrix and Eigenvalues of the Matrix 
for a proof of this fact. 

Hence it follows from this fact that A is a nilpotent matrix. 














0 Let ibenxn matrix and let Ai, A 2 ,..., A n be all the eigenvalues of A. (Some of them may be the same.) 
For each positive integer k, prove that Aj, Ag,..., A n are all the eigenvalues of A k . 

By the triangularization (or Jordan canonical form), there exists a nonsingular matrix S such that 


S~ l AS 


Ai * * * * 

0 A 2 * * * 


0 0 0 A n _i * 
0 0 0 0 A n 


Here the right matrix is an upper triangular matrix whose diagonal entries are eigenvalues of A. 


Then we have 


S- l A k S = (ST 1 AS)* = 


\ K v|> 

/\ ‘T* “T* 

0 A| * * * 

0 0 0 Atx * 

0 0 0 0 x k 


The characteristic polynomial of the matrix A k is given by 

p(t ) = det(A fc — tl ) 

= det(S _1 )det(A* - tl)det(S) 

= det (S-\A k -tI)S) 

= det(S~ 1 A k S - tl) 

\\—t * * 

0 A| — t * 

0 0 0 

0 0 0 

=nw - *>■ 

i =1 

Since the roots of the characteristic polynomial are all the eigenvalues, we see that A^, • • •, A 
are all the eigenvalues of A k . 


X n -1 - * * 

0 Xn - t 


Let A be an n X n matrix and suppose that A r = I n for some positive integer r. Then show that: 

(a) |tr(A)| < n. 

(b) If \tr(A) | = n, then A = (I n for an r-th root of unity 

(c) tr( A) — n if and only if A — I n . 


Proof. 











(a) |tr(^)| < n . 

We fist show that the eigenvalues of A are r-th roots of unity. 

Let A be an eigenvalue of A and let x be an eigenvector corresponding to A . That is, Ax — Ax . 
Multiplying this equality by A on the left, we see that 

A 2 x — AAx — A 2 x. 

Again multiplying this equality by A on the left we getv4 3 x = A 3 x . 

Inductively, we obtain A r x — A r x . 

Since A r — I n , we get x = A r x , Thus (A r — l)x = 0 and since x is nonzero vector (as it is an 
eigenvector), we have A r — 1. Therefore eigenvalues are r-th roots of unity. 


Next, we consider the Jordan canonical form of A . There exists an invertible matrix P such that P ~ l AP 
is an upper triangular matrix T — (t ij ) whose diagonal entries are eigenvalues of A . 

Since the trace of A is equal to the trace of P ~ l AP , we see that tr (A) is a sum of n r-th roots of unity. 
Thus 


|trMI = lI>l<£M=»- (*) 

i =1 i =1 

(b) If | tr (A ) | = n , then A = Q n for an r-th root of unity C 
We first refine the proof of part (a). 

Note that the matrix A is a solution of the equation x r — 1, 

So the minimal polynomial of A divides x r — 1 , thus the minimal polynomial has no repeated roots. 
Thereby the Jordan canonical form T is actually a diagonal matrix. 

Now we start the proof of (b). The inequality in (*) becomes the equality if and only if all the roots of unity 
tu are the same root of unity 

Then the Jordan canonical form is T — £I n , hence A — £I n . 

(c) tr(yi) = n ifandonlytfT4 —I n 

By part (b), we have A — £I n for some r-th root of unity C but then n — tv(A) — implies £ — 1 . 
Thus A — I n as required. 

Let A be an n X n matrix such that A k = I n , where k E N and I n is the n X n identity matrix. Show that the 
trace of ( A~ 1 ) T is the conjugate of the trace of A. That is, show that tr((A -1 ) T ) = tr (A). 


Hint. 


Note that tr(5) = tr(5 T ) for any square matrix B . 




0 Use the Jordan canonical form of A . 


0 Show that eigenvalues are k-th roots of unity. 


Proof. 

First note that the trace of a matrix is the same as the trace of its transpose. Thus we only have to show 

tr(A _1 ) = tr(^). 

There is an invertible matrix P such that P 1 A P is the Jordan canonical form. That is, P 1 A P — T , 
where T is an upper triangular matrix whose diagonal entries are eigenvalues of A . 

Since A is invertible (to see this take the determinant of A k — I n ), the matrix T is also invertible and 
P~ l A ~ l P = T ~ l . Then we have 

tr (A) =tr (P^AP) = tr (T) 
tr(A -1 ) = tr (P^A^P) = tr(r _1 ) 


Now let X i , X 2 , • • •, X n be eigenvalues of A . Then the upper triangular matrix T and its inverse matrix 
are 



"Ai 

* 

* 

* 


’^r 1 

* 

* * 

T = 

0 

N 

to 

* 

* 

, T 1 = 

0 

^•2 1 

* * 


0 

0 




0 

0 



. 0 

0 


X n _ 


0 

0 

; -1 

... A n J 


Thus we have 

tr(^ 4 ) = A i X 2 + • • • + X n 
tr (A ) "T X 2 i X n ^ 

Next, we show that X T 1 = X / for i — 1,2. 

This follows from the fact that ifv4 k — I n then the eigenvalues are &-th roots of unity. 
Assuming this, we have 1 = \X i \ — X[X /, hence A r 1 — Xi . 


To prove the fact, let X be an eigenvalue of A and let x be an eigenvector corresponding to X . 
Then we have Ax — Xx . Using this relation successively we have 

x = I n x — A k x — XA k ~ 1 x = X 2 A k ~ 2 x — • • • = X k x 


Since x is a nonzero vector, we have X 


k 


— 1, and X is a &-th root of unity. 








Now we have 


tr (A — ^ 1 ^ H - A 2 i H - * * • H - A 72 ^ 
= ^ 1 + ^2 + * * * + A n 
— A i + A 2 + *** + A n 
= tr(A). 


This completes the proof. 


0 


(a) Does there exist a 2 X 2 matrix j 4 with A 3 = O but A~ ^ O? Here O denotes the 2x2 zero matrix. 

(b) Does there exist a 3 X 3 real matrix B such that B 2 = A where 


A = 


1 

-1 

0 


-1 0 " 

2 -1 ? 

-1 1 


(Princeton University) 


Hint. 

$ (a) Consider the eigenvalues and the Jordan canonical form of A 


(b) Diagonalize the matrix A 


Solution. 

(a) Does there exist a 2 X 2 matrix^ with A 3 = O but A 2 ^ O? 

We claim that there is no such 2x2 matrix A . 

Suppose that we have A 3 = O. This implies that all the eigenvalues ofv4 are zero. 

To see this, let A be an eigenvalue of A and let x be a corresponding eigenvector. Then applying the 
defining relation Ax = Ax successively, we have 

= A 3 x — AA 2 x — A 2 Ax = A 3 x. 


Thus, the eigenvalue A must be zero since x is a nonzero vector since it is an eigenvector. 
Therefore, there exists an invertible matrix P such that the Jordan canonical form of A is 
0 a 


P ~ l AP = 
Thenx4 = P 


0 0 . 
0 a 

0 0 


for some number a. 


and we have 


A 2 = 


0 a 
0 0 


,-i 


0 a 
0 0 


, -l 


= P 


0 a 
0 0 


l 2 


, -1 






















Hence A 2 must be zero, and we conclude that there is no 2 X 2 matrix A satisfying both A 3 = O and 
A 2 + O. 


(b) Does there exist a 3 X 3 real matrix B such that B 2 = A? 

The answer is yes and we will construct such a matrix B as follows. 

The characteristic polynomial p{t) of the matrix A is 

p(t ) := det(^4 — tl) — —t(t — l)(t — 3). 

Thus the eigenvalues are A — 0,1, 3 . 

The corresponding eigenvalues are obtained by solving the system (A — AI)x = 0 and we see that 


" 1 " 


' 1 " 


" 1 ' 

1 

5 

0 

5 

1 

to 

_ 1 _ 


.- 1 . 


. 1 . 


are eigenvectors corresponding to eigenvalues 0,1, 3, respectively. 
Thus the matrix 


P = 


111 
10-2 
1 -1 1 


is invertible and diagonalize the matrix A such that 


P ~ l AP = 


0 0 0 
0 10 
0 0 3 


Then if we have the equality B 2 — A , then we need to have 
(. P~ 1 BP)(P~ 1 BP ) = p- x AP = 


0 0 0 
0 10 
0 0 3 


From this we see that the matrix 


B = P 


0 0 O' 
0 10 
.0 0 vT 


i-l 


satisfies the above relation. 
In fact, we compute 
















B 2 = 


0 0 0 
0 10 
.o o Vs_ 



0 0 0 
0 10 
.o o Vs. 


,-1 



"0 

0 

0 ' 

2 

'0 

0 

o' 

p 

0 

1 

0 

p~ l =p 

0 

1 

0 


.0 

0 

vT 


.0 

0 

3. 


P~ l = A. 


Remark that the matrix B is real since P is real. Thus the matrix B satisfies the conditions of the problem. 
Remark. 

The explicit form of the matrix P was used to assure that the matrix B is real. 

So we did not have to compute the matrix P explicitly if we use the fact that the symmetric matrix is 
diagonalizable by a real orthogonal matrix. 

Generalization. 

See problem A square root matrix of a symmetric matrix with non-negative eigenvalues for a more general 
question than part (b). 

Furthermore, try the next problem. 


Problem. 

Prove that a positive definite matrix has a unique positive definite square root. 


For a solution of this problem, see the post 

A Positive Definite Matrix Has a Unique Positive Definite Square Root 











